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LIU,  JIA.  Transmitter-based  Multiple  Access  Interference  Rejection  and  Diversity 
Techniques  for  Code-division  Multiple  Access  Systems.  (Under  the  direction  of  Dr. 
Alexandra  Duel-Hallen) 

For  the  downlink  of  direct  sequence  code-division  multiple  access  (DS-CDMA) 
systems,  transmitter  (Tx)-based  multiple  access  interference  (MAI)  cancellation  techniques, 
termed  multiuser  precoding,  and  Tx-based  diversity  techniques  can  significantly  increase 
system  capacity  while  remaining  low  complexity  at  mobile  stations  (MS).  We  have  proposed 
novel  linear  and  nonlinear  decorrelating  precoding  algorithms.  Hybrid  transmitter  designs 
were  developed  to  combine  MAI  cancellation  and  diversity  techniques.  We  have  also 
analyzed  the  important  duality  between  Tx-based  multiuser  precoding  and  receiver  (Rx)- 
based  multiuser  detection. 

The  class  of  decorrelating  precoding  techniques  is  simple,  efficient  and  satisfies  the 
minimum  mean  square  error  (MMSE)  criterion.  The  nonlinear  decorrelating  Tomlinson- 
Harashima  precoding  (THP)  inherently  outperforms  linear  MAI  cancellation  methods.  For 
frequency-selective  fading  channels,  we  developed  the  THP  with  Pre-RAKE  combiner 
(PreRAKETHP)  and  Multipath  Decorrelating  THP  (MDTHP)  designs.  While  the 
PreRAKETHP  is  the  optimal  THP  design  by  both  ZF  and  MMSE  criteria,  it  is 
computationally  complex  because  the  MAI  cancellation  filters  depend  on  the  instantaneous 
channel  gain  coefficients  and  therefore  need  to  be  updated  frequently.  In  the  MDTHP,  the 
precoding  filter  is  independent  of  the  channel  state  infonnation  (CSI).  The  MDTHP  is 
simpler  than  PreRAKETHP  at  the  expense  of  moderate  perfonnance  loss. 
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For  Pulse  Amplitude  Modulated  (PAM)  and  Quadrature  Amplitude  Modulated 
(QAM)  systems  with  small  constellation  index,  the  practical  performance  of  THP  is  degraded 
due  to  the  side  effects  of  modulo  operation.  In  contrast,  linear  precoding  is  not  influenced  by 
modulation  and  is  simpler  to  implement.  We  have  developed  several  linear  precoding 
techniques.  The  PreRAKE  Linear  Decorrelating  Precoding  (PreRAKELDP)  is  the  optimal 
ZF  and  MMSE  precoding  method.  The  Multipath  Decorrelating  Precoding  (MDP)  provides  a 
simpler  but  suboptimal  scheme.  It  is  shown  that  the  PreRAKELDP  and  MDP  outperform  the 
existing  linear  precoding  techniques  with  similar  complexity.  The  system  performance  is 
further  improved  by  employing  multiple  antennas  in  transmitter  design. 

Both  multiuser  precoding  and  transmit  diversity  techniques  require  the  knowledge  of 
CSI  at  transmitter.  For  rapidly  varying  channels,  the  long  range  channel  prediction  (LRP) 
provides  the  accurate  CSI  in  time.  We  have  demonstrated  that  the  LRP  method  enables  the 
proposed  Tx-based  techniques  for  practical  systems. 
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Chapter  1 


INTRODUCTION 

1.1  Background 

Wireless  communication  is  dramatically  changing  our  lives.  The  ability  to 
communicate  anytime  and  anywhere  increases  our  quality  of  lives  and  improves  our  business 
productivity.  In  cellular  wireless  systems,  the  network  consists  of  numerous  mobile  users 
communicating  with  one  or  multiple  base  stations  (BS)  that  are  interconnected  with  a  mobile 
telephony  switching  office.  Originated  from  the  spread-spectrum  techniques,  the  code¬ 
division  multiple  access  (CDMA)  systems  can  support  simultaneous  digital  communication 
among  a  large  community  of  relatively  uncoordinated  users.  CDMA  exhibits  potential 
capacity  increase  over  the  conventional  time-division  multiple  access  (TDMA)  and 
frequency-division  multiple  access  (FDMA),  because  CDMA  capacity  is  only  interference 
limited,  while  TDMA  and  FDMA  are  primarily  bandwidth  limited  [GJP91].  The  inherent 
frequency  diversity  of  wideband  signals  enables  CDMA  to  efficiently  suppress  the  narrow- 
band  interference  in  radio  link.  Direct  sequence  (DS)  CDMA  has  been  widely  applied  in  the 
second  and  third  generations  cellular  standards  (IS-95,  WCDMA  and  CDMA2000).  It  is  also 
a  very  promising  option  for  the  next  generation  wireless  communications. 

In  practical  wireless  channels,  the  multiple  access  interference  (MAI)  is  a  major 
limitation  to  the  performance  of  DS-CDMA  systems.  Due  to  multiple  reflections  in  multipath 
fading  channels  and  asynchronous  transmission,  the  received  signal  contains  delayed, 
distorted  replicas  of  the  original  transmitted  signal.  For  each  user,  the  MAI  includes  the 
interference  from  other  users  and  self-interference.  Over  the  past  decade,  various  MAI 
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rejection  techniques  have  been  developed.  This  research  has  primarily  focused  on  receiver 
(Rx)-based  multiuser  detection  (MUD)  [Ver98]  that  results  in  a  complex  receiver  while  the 
transmitter  remains  simple.  Thus,  MUD  techniques  are  mostly  suitable  for  the  uplink.  For  the 
downlink  CDMA  channel,  the  requirements  of  small-size  low-power  mobile  station  (MS) 
have  motivated  the  development  of  Tx-based  MAI  pre-rejection  techniques  in  BS,  termed 
multiuser  precoding.  In  addition  to  Tx-based  MAI  cancellation,  Tx-based  diversity 
techniques  also  efficiently  improve  system  capacity.  For  frequency-selective  fading  channels, 
to  simplify  the  mobile  user  receivers,  the  transmitter  (Tx)-based  PreRAKE  combining  is 
more  suitable  for  the  downlink  than  the  conventional  RAKE  receiver  [EN95].  This  technique 
can  utilize  multipath  diversity  as  efficiently  as  RAKE  receiver.  The  Tx-based  antenna  array 
is  another  efficient  diversity  technique  to  increase  the  received  signal  to  noise  ratio  [BZP04, 
GD05],  In  the  practical  design  of  multiuser  precoding  techniques,  the  problems  of  MAI 
rejection,  transmit  power  control  and  diversity  strategies  need  to  be  considered 
comprehensively. 

The  class  of  decorrelating  zero-forcing  (ZF)-based  precoding  techniques  is  simple, 
efficient  and  satisfies  the  minimum  mean  square  error  (MMSE)  criterion  [VJ98,  BDOO],  thus 
is  most  promising  in  practical  applications.  Inspired  by  Tomlinson-Harashima  equalization 
[LM94],  the  nonlinear  method  of  Tomlinson  Harashima  Precoding  (THP)  proposed  in 
[WFVH04]  utilizes  a  feedback  (FB)  loop  and  a  feed-forward  (FF)  filter  to  jointly  cancel 
MAI.  An  independent  work  on  THP  [LD03]  presented  the  significant  duality  between  THP 
and  the  Decision-Feedback  (DF)  MUD  [Due93].  Both  [WFVH04,  LD03]  assume  that  the 
non-orthogonal  effective  spreading  codes  on  the  downlink  arise  due  to  multipath  fading. 
However,  the  channel  models  employed  in  these  papers  do  not  reflect  the  frequency-selective 
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fading  channel  environment,  and  can  only  be  viewed  as  flat  fading  non-orthogonal  CDMA 
channels.  The  THP  designs  for  frequency-selective  fading  channels  are  presented  in  [LD04, 
LDg04,  LDj04].  In  this  thesis,  we  first  illustrate  the  principle  of  THP  for  the  simple  case  of 
single-path  channels  with  additive  white  Gaussian  noise  (AWGN)  and  flat  Rayleigh  fading 
channels.  Then  we  develop  two  specific  THP  designs  for  frequency-selective  fading 
channels,  THP  with  PreRAKE  combiner  (PreRAKETHP)  and  Multipath  Decorrelating  THP 
(MDTHP).  The  PreRAKETHP  and  MDTHP  incorporate  Tx-based  diversity  combining 
techniques  in  different  ways.  In  PreRAKETHP,  the  MAI  cancellation  is  followed  by  the  pre- 
RAKE  combining  [EN95].  While  this  precoder  is  the  optimal  THP  design  for  multipath 
channels,  it  requires  high  computational  complexity,  since  its  MAI  cancellation  filters 
depend  on  the  rapidly  time  variant  mobile  radio  channel  coefficients  and  need  to  be  updated 
frequently.  In  MDTHP,  the  diversity  combining  is  incorporated  into  the  MAI  cancellation, 
and  the  precoding  filter  is  independent  of  the  channel.  Thus,  MDTHP  is  simpler  than 
PreRAKETHP,  and  results  in  moderate  bit  error  rate  (BER)  loss. 

Several  MAI  cancellation  techniques  used  in  transmitter  for  the  downlink  of  CDMA 
systems  have  analogous  structure  and  similar  performance  to  those  employed  in  the  receiver 
for  the  uplink.  We  address  this  duality  and  show  that  THP  outperforms  previously  proposed 
Tx-  and  Rx-based  linear  and  nonlinear  decorrelating  methods  in  [VJ98,  BDOO,  TC94,  Gun03, 
GDv03,  GDc03,  HS94,  SK97,  ZB96,  Due93].  In  addition  to  CDMA  systems,  THP  is  also  an 
efficient  interference  rejection  approach  for  various  multi-input/multi-output  (MIMO) 
systems,  such  as  orthogonal  frequency  division  multiplexing  (OFDM)  and  multiple-antenna 
channels  [WFVH04].  THP  is  especially  beneficial  as  an  alternative  to  decision-feedback 
receivers  in  coded  systems  [LM94,  ProOl]. 
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In  addition  to  nonlinear  decorrelating  precoding  techniques,  several  linear 
decorrelating  precoding  techniques  are  presented  in  this  thesis.  Compared  to  nonlinear  THP 
precoding,  the  linear  methods  have  lower  complexity  and  higher  error  rate,  but  their 
performance  is  not  influenced  by  modulation  method.  In  the  previously  proposed  linear 
deocrrelating  precoders  [VJ98,  BDOO,  Gun03,  GDv03],  the  global  power  scaling  is 
commonly  used  to  normalize  the  transmit  power.  It  is  shown  that  the  precoding  performance 
is  degraded  by  the  power  scaling.  In  [VJ98],  the  MMSE-based  linear  precoding  with  transmit 
power  constraint  is  also  investigated.  We  propose  two  novel  linear  precoding  methods, 
PreRAKE  Linear  Decorrelating  Precoding  (PreRAKELDP)  and  Multipath  Decorrelating 
Precoding  (MDP)  [LDm05].  In  the  PreRAKELDP  transmitter,  the  preRAKE  combiner  is 
preceded  by  a  linear  precoding  filter.  Based  on  different  transmit  power  control  strategies,  we 
propose  two  solutions  for  the  precoding  filter.  One  is  the  ZF-based  filter  with  individual  user 
power  scaling,  in  which  each  user’s  average  transmit  power  is  nonnalized  by  a  power  scaling 
factor.  The  other  is  the  MMSE-based  filter  under  the  total  transmit  power  constraint.  (Note 
that  if  the  individual-user  transmit  power  constraint  rather  than  the  total  transmit  power 
constraint  is  enforced,  the  MMSE  solution  is  identical  to  the  ZF  solution  [VJ98,  BDOO].) 
When  these  two  PreRAKELDP  solutions  are  compared,  the  ZF  precoder  is  much  simpler 
while  the  MMSE  precoder  has  obviously  better  perfonnance.  It  should  be  noted  that  to 
achieve  a  simple  and  practical  transmitter  design  for  a  system  with  large  user  number,  the 
allocation  of  individual  user  transmit  powers  is  not  taken  into  account  in  the  PreRAKELDP 
designs.  Since  the  pre-RAKE  combiner  is  equivalent  to  a  matched  filter  which  is  matched  to 
the  multipath  fading  channels,  the  PreRAKELDP  with  individual  user  power  scaling  and  that 
with  total  transmit  power  constraint  are  the  optimum  linear  ZF  and  MMSE  precoders, 
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respectively,  under  the  condition  that  the  transmit  power  of  individual  user  is  not  specified. 
In  the  MDP  transmitter,  the  multipath  diversity  combining  is  incorporated  into  the  MAI 
cancellation  process,  and  the  decorrelating  filter  is  independent  of  the  channel  fading 
coefficients  and  only  determined  by  users’  signature  sequences.  As  a  result,  the 
computational  complexity  of  the  MDP  is  lower  than  that  of  the  PreRAKELDP,  Pre-RDD  and 
the  precoders  in  [VJ98,  BDOO], 

Compared  to  the  existing  Tx-based  and  Rx-based  linear  decorrelating  methods,  the 
PreRAKELDP  with  total  power  constraint  has  the  best  performance  and  highest  complexity; 
the  PreRAKELDP  with  individual  power  scaling  has  similar  performance  to  the  Pre-RDD 
and  RDD;  the  MDP  has  identical  error  rate  to  the  Rx-based  Multipath  Decorrelating  Detector 
(MDD)  [LD03],  The  two  proposed  precoders  outperfonn  those  in  [VJ98,  BDOO,  GDv03, 
Gun03], 

A  crucial  assumption  for  Tx-based  interference  cancellation  methods  is  that  the 
transmitter  has  the  knowledge  of  channel  conditions.  The  channel  state  information  (CSI)  can 
be  estimated  at  the  receiver  and  sent  to  the  transmitter  via  a  feedback  channel.  Thus, 
feedback  delay  and  overhead,  processing  delay  and  practical  constraints  on  modulation, 
coding  and  antenna  switching  rates  have  to  be  taken  into  account  in  the  performance  analysis 
of  adaptive  transmission  methods.  For  very  slow  fading  channels,  outdated  CSI  is  sufficient 
for  reliable  adaptive  transmission.  However,  for  faster  fading  that  corresponds  to  realistic 
mobile  speeds,  the  channel  profile  is  quite  different  at  the  time  of  transmission  when 
compared  to  the  outdated  CSI.  As  a  result,  even  small  delay  will  cause  significant 
performance  degradation  due  to  the  channel  variation.  To  enable  Tx-based  signal  processing 
techniques,  the  CSI  at  the  instant  of  transmission  should  be  reliably  predicted,  based  on  the 
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outdated  CSI  feedback  from  the  mobile.  Most  existing  channel  prediction  methods  require 
large  computational  complexity  or  are  only  suitable  for  very  short  range  prediction  [LPL95, 
HM89,  ZS96,  WD95].  The  long-range  channel  prediction  (LRP)  is  a  linear  adaptive 
prediction  technique  proposed  in  [EDH98,  DHHOO].  In  this  thesis,  we  investigate  transmitter 
precoding  aided  by  the  LRP.  The  LRP  algorithm  characterizes  the  fading  channel  using  the 
autoregressive  (AR)  model  and  computes  the  MMSE  estimate  of  a  future  fading  coefficient 
sample  based  on  a  number  of  past  observations.  The  superior  performance  of  this  algorithm 
relative  to  conventional  methods  is  due  to  its  low  sampling  rate.  Given  a  fixed  model  order, 
lower  sampling  rate  results  in  longer  memory  span,  pennitting  prediction  further  in  the 
future.  We  employ  the  LRP  algorithm  to  enable  the  proposed  Tx  diversity  and  precoding 
methods  in  practical  WCDMA  systems. 
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1.2  Outline  of  the  Thesis 


The  goal  of  this  thesis  is  to  investigate  and  compare  the  Tx-based  and  Rx-based 
decorrelating  MAI  cancellation  methods,  with  the  application  of  transmit  diversity  and  long- 
range  channel  prediction.  The  thesis  is  outlined  as  follows: 

Chapter  2  first  describes  the  frequency-selective  channel  model  of  uplink  CDMA,  the 
linear  and  nonlinear  (DF)  decorrelating  multiuser  detection  methods  are  then  presented  and 
their  complexity  and  performance  are  compared. 

From  Chapter  3  to  Chapter  5,  we  concentrate  on  the  downlink  CDMA  channels.  In 
Chapter  3,  we  first  describe  the  centralized  and  decentralized  channel  model  with  single-path 
additive  white  Gaussian  noise  (AWGN)  and  flat  Rayleigh  fading.  The  THP  designs  for  these 
centralized  and  decentralized  channels  are  presented,  respectively. 

Based  on  the  THP  principle  introduced  for  single-path  channels,  we  develop  the  THP 
techniques  for  multipath  channels  in  Chapter  4.  By  combining  the  THP  algorithm  with 
multipath  diversity  in  different  ways,  we  present  the  PreRAKETHP  and  MDTHP  designs  in 
detail.  The  duality  between  these  THP  methods  and  Rx-based  DF  methods  is  analyzed.  We 
also  briefly  review  the  principle  of  LRP.  The  performance  of  THP  aided  by  LRP  is  observed 
in  simulation  experiments. 

In  Chapter  5,  we  focus  on  the  linear  decorrelating  precoding  methods.  The  previously 
proposed  decorrelating  methods  are  first  presented  for  the  purpose  of  comparison.  Then  we 
describe  the  novel  PreRAKELDP  and  MDP.  In  these  derivations,  multiple  transmit  antennas 
are  considered.  These  two  proposed  methods  are  compared  with  the  nonlinear  THP,  linear 
decorrelating  precoders  and  linear  decorrelating  multiuser  detectors.  Finally,  the  significant 
conclusions  of  this  thesis  are  summarized  in  Chapter  6. 
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Chapter  2 


DECORRELATING  MULTIUSER  DETECTION  FOR  UPLINK  CDMA 

2.1  Frequency-selective  Fading  Uplink  CDMA  Channel  Model 

For  the  uplink  of  CDMA  systems,  MS  transmit  signals  towards  BS.  Consider  the 
uplink  of  a  /f-uscr  DS/CDMA  system  with  a  set  of  pre-assigned  normalized  signature 
sequences  y(7),  /  =  1,  2,  K,  where  each  sequence  is  restricted  to  a  symbol  interval  of 
duration  T.  The  data  symbol  for  the  /th  user  in  the  nth  symbol  interval  is  denoted  by  bt{n).  In 
practical  wireless  environment,  the  transmit  signals  are  corrupted  by  fading  and  additive 
noise.  If  the  data  symbol  duration  is  much  smaller  than  the  channel  coherence,  the  channel  is 
slowly  fading,  implying  that  the  channel  characteristics  can  be  estimated  accurately.  If  signal 
bandwidth  is  greater  than  the  channel  coherence  bandwidth,  the  signal  is  subject  to  different 
gains  and  phase  shifts  across  the  band,  thus  the  fading  is  said  to  be  frequency-selective 
[ProOl].  The  practical  CDMA  channels  are  usually  frequency  selective,  due  to  the  large 
bandwidth  of  spread-spectrum  signals.  In  this  case,  the  received  signal  includes  multiple 
versions  of  the  transmitted  waveform  which  are  attenuated  and  delayed  in  time.  The 
frequency-selective  slow  fading  channel  can  be  modeled  by  a  tapped-delay  line  [ProOl]. 
Suppose  there  are  N  resolvable  paths  from  each  MS  to  the  BS.  In  the  nth  transmit  symbol 

interval,  Cjj(n)  =  alj(n)el'p‘J  ")  represents  the  gain  of  the  /th  path  component  over  the  channel 
between  the  /th  user  transmitter  and  the  receiver,  V  i=  1,2,..., A  and  /=  0,1,...,  N-\.  In  the 
special  case  of  Rayleigh  fading,  cy/(n)  is  a  complex  Gaussian  random  process;  equivalently, 
the  magnitudes  au(n)  are  Rayleigh-distributed  and  the  phases  (p,j(n)  are  uniformly 
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distributed.  Due  to  uncorrelated  scattering  assumption,  the  channel  gains  corresponding  to 
different  paths  are  statistically  independent. 

Suppose  K  users  transmit  a  stream  of  2M+1  data  bits  simultaneously,  and  their  signals 
are  received  synchronously  at  the  BS,  then  the  equivalent  low-pass  received  signal  is 

M  K  JV-1 

r{t)  =  X  X  X  Cij(n)bi(n)Si(t-nT-lTc))  +  n(t),  (2.1) 

n=—Mi—  1  /=0 

where  Tc  is  the  chip  duration  and  n(t)  is  a  white  Gaussian  noise  process  with  zero  mean  and 
power  spectrum  density  No.  From  equation  (2.1),  it  is  seen  that  even  if  the  spreading  codes 
are  orthogonal,  due  to  multipath  fading,  in  the  received  signal  there  is  multiple  access 
interference  (MAI),  including  inter-user  interference,  inter-symbol  interference  (ISI)  and 
inter-chip  interference  (ICI).  When  the  chip  interval  Tc  is  much  smaller  than  the  symbol 
interval  T  and  the  multipath  delay  spread  Tm  is  on  the  order  of  a  few  chip  intervals,  the  ISI 
due  to  channel  dispersion  may  be  neglected  [ZB96].  For  synchronous  channels  with  this 
assumption,  (2.1)  can  be  simplified  by  only  considering  a  single  symbol  interval  [0,  T), 

r(t )  =  X  X  Cijbjs,(t-lTc )  +  n(t),  (2.2) 

i=  1  /=o 

Where  c,j  =  cy/(0)  and  h,  =  b,( 0). 

The  data  symbols  of  all  users  can  be  expressed  as  a  vector  b  =  [b\,  £>2,...,  hx]r. 
Suppose  the  data  symbols  are  M- ary  PAM  modulated  with  the  minimum  Euclidean  distance 
2 Ah  i.e.,  bjB  {-(M-\)Ah  -( A/-3 )A ,-,...,( M-3 )A „  (M- 1  )d,},  i  =  1,  2,...,  K.  Define  the  signature 
sequence  vectors  s ,{t)  =  [.v,(/)  s,(t-Tc)  ...  Si(t-(N-\)TC)]T,  i  =  1,2 For  all  the  K  users, 
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T  T  T  T 

define  the  vector  of  delayed  signature  sequences,  s(t)  =  [si(0  S2(0  ...  $dt)  ]  ,  and  the 
correlation  matrix  of  delayed  signature  sequences 

T 

R  =  / s(t)sT(t)dt,  (2.3) 

o 


The  channel  gain  vector  for  the  /th  user  is  defined  as  c,  =  [c,- 0  c,j  . . .  cy.y- 1  ] 7 ;  and  for  K 
users,  define  KN- row,  A'-column  matrix  C  as 


C  =  diag{cj} 


ci  0  0  ...  0 
0  C2  0  ...  0 


L  0  0  . c^J 


(2.4) 


in  which  0  =  {0}jvxi.  The  conjugate  transpose  of  c*  is  c/7  =  [cy0*,  c,-, i*,...,  c;yv- 1*].  Hence,  the 
conjugate  transposed  matrix  of  C  is  CH=  diag{ c/7},  which  has  K  rows  and  KN  columns. 

By  employing  the  D-transfonn  and  spectral  factorization  as  in  [Due95],  the  above 
synchronous  system  model  can  be  extended  to  the  asynchronous  case.  Since  the 
mathematical  model  for  asynchronous  channels  is  in  the  form  similar  to  that  for  synchronous 
channels,  the  MAI  cancellation  approaches  for  asynchronous  channels  can  be  easily  derived 
from  those  for  synchronous  case.  Therefore,  without  loss  of  generality,  we  only  demonstrate 
MAI  cancellation  methods  for  synchronous  channels. 
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2.2  Linear  Decorrelating  Multiuser  Detection  Techniques 


2.2.1  RAKE  Decorrelating  Detector  (RDD) 


Figure  2.1  Decorrelating  Detector  (RDD)  for  a  K-user  iV-channel  paths/user  system 


The  conventional  RAKE  receiver  is  optimal  in  the  absence  of  MAI;  therefore  it  can 
be  employed  for  single-user  systems.  However,  the  RAKE  receiver  suffers  from  near-far 
effects  in  the  presence  of  interfering  signals  received  over  independent  fading  channels 
[KS93].  Even  with  perfect  channel  tracking,  the  near- far  problem  imposes  a  fundamental 
limit  on  the  performance  of  conventional  RAKE  receiver. 

In  order  to  exploit  the  multipath  diversity  offered  by  the  multipath  propagation,  but 
avoid  the  prohibitive  complexity  of  optimal  MLSE  detection,  the  method  of  RAKE 
decorrelating  detector  (RDD)  applies  a  bank  of  conventional  RAKE  combiners,  which  is 
followed  by  a  linear  decorrelating  filter  [HS94].  The  receiver  structure  for  a  /f-uscr  and  N- 
channel  paths/user  system  is  shown  in  Fig.  2.1. 
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Assuming  the  BS  can  perfectly  estimate  the  channel  coefficients,  the  front-end  of  the 
RDD  receiver  consists  of  K  RAKE  combiners.  The  outputs  are  incorporated  into  a  vector, 
v  =  [\'i ,  V2,  ■  •  - ,  Va-]7-  The  outputs  of  the  RAKE  MF  bank  are  sampled  at  the  end  of  each 
symbol  interval,  and  the  resulting  output  vector  y  is  composed  of  K  elements,  yi,  yi,...,  Vk, 
which  are  given  by 

T 

Vk  =  fr(t)ckHSk(t)dt,  k=\,  2,...,  K,  (2.5) 

o 


where  ca  and  s* (t)  follow  the  same  definitions  as  those  in  section  2.1.  For  all  the  K  users,  the 
RAKE  combining  output  vector  can  be  expressed  by 

y  =  Rcb  +  n,  (2.6) 

where  Rc  =  C/7RC  is  a  KxK  matrix.  The  element  on  the  /th  row  yth  column  of  Rc  is 

T 

Rcij  =  fsj(i)CjCjHSj(t)dt,  V  i,j  =  1,  2,..., K.  The  noise  vector  n  is  zero-mean  Gaussian  with 
o 


autocorrelation  matrix  AoRc.  If  y  is  filtered  by  Rc  ,  the  MAI  can  be  cancelled, 

Rc  V  =  b  +  Rc  ‘n.  (2.7) 

For  BPSK  systems,  the  average  transmit  SNR  per  bit  for  the  Mi  user  is  given  by  [ProOl] 

Ebk  A  A- 2 

Ybk  =  jr  =  ~2N~>  k=  1,2 Therefore,  the  theoretical  SER  can  be  expressed  in  terms  of 


Ybh 


Pek  =  Q 


2r, 


bk 


'[Rc  ]a,A 


where  [Rc  1  ]aa  is  the  Alh  diagonal  element  of  matrix  Rc 


(2.8) 
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It  is  illustrated  in  [LV89]  that  the  conventional  decorrelating  detector  for  single  path 
signals  provides  optimum  linear  multiuser  detector  performance  in  the  maximum-likelihood 
sense  when  the  users’  energies  are  unknown.  The  RDD  is  a  linear  detector  which  do  not 

N- 1 

require  knowledge  of  the  user  energies.  If  s,{t)  =  X  Cgs,{t-ITC )  is  regarded  as  the  /'th  user’s 

/=o 

signature  waveform,  the  RDD  has  the  structure  similar  to  the  conventional  decorrelating 
detector;  hence,  the  RDD  should  provide  the  optimum  perfonnance  over  all  linear  multiuser 
detectors  for  multipath  signals  of  unknown  energy. 

2.2.2  Multipath  Decorrelating  Detector  (MDD) 


yi 


b\ 


bK 


Figure  2.2  Multipath  Decorrelating  Detector  (MDD)  for  a  ff-user  /V-channel  paths/user 

system 


13 


Compared  to  RDD,  MDD  is  a  simpler  but  suboptimal  linear  multiuser  detector.  As 
shown  in  Fig.  2.2,  the  front  end  of  the  MDD  detector  consists  of  KN  correlators.  Represent 
the  KN  outputs  of  the  correlators  by  a  vector,  y  =  [n,y2,...,yw]7.  Then  y  is  given  by 

T 

y  =  fr(t)s(t)dt  =  RCb  +  n.  (2.9) 

o 

n  is  the  Gaussian  noise  vector  with  zero  mean  and  autocorrelation  matrix  NoR.  To  eliminate 
MAI,  the  decorrelating  filter  is  defined  as  R  '.  The  decorrelated  result  is 

v  =  R  y  =  Cb  +  nv  (2.10) 

We  know  in  the  vector  v  =  [vi,  V2,  . . .,  vKn]T,  every  N  elements  correspond  to  one  user,  thus  it 
is  convenient  to  define  \k  =  [v(*-pw-i,  V(k-i)N+2,  •••,  vkN]T,  V  k=  1,  2,...,  K,  then 
v  =  [vi  ,  v2  ,. . .,  \k  ]  ■  From  equation  (2.10),  we  have 

\k  =  ckbk  +  nk,  k=  1,2,...,  K,  (2.11) 

in  which  nk  =  [n^-i)N+u  n<k-\)N+2,  •••,  nA  v] '■  The  autocorrelation  matrix  of  nA  is  .W[R  x\k, 
l  is  the  ijth  /Vx/V  block  of  R  1 ).  Due  to  noise  correlation  among  the  A  branches  for  the 
kth  user  at  the  output  of  the  decorrelating  filter,  the  whitening  operation  is  introduced  prior  to 
combining.  The  whitening  filter  for  the  kth  user  D/(  7  is  obtained  from  Cholesky  factorization 
[GV96],  [R-\k  =  D/c7Da,  \/k  =  1,2 The  whitened  result,  denoted  by  vw/(,  is  given  by 

vw/(  =  D kT\k  =  Da  Tckbk  +nwk,  k  =  1 ,  2, . . . ,  K.  (2.12) 

The  whitened  noise  nwA  =  RkTnk  is  zero-mean  and  Gaussian  with  the  autocorrelation  matrix 

Aoljvxjv. 

In  the  scenario  of  whitened  noise,  maximal  ratio  combining  (MRC)  is  the  optimal 
combiner  in  the  sense  of  achieving  the  highest  SNR  at  the  input  of  detection  device  [ProOl]. 
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Although  MRC  is  the  optimal  combiner,  the  noise-whitening  process  in  MDD  re-introduces 
the  self-interference  for  every  user,  since  the  N  branches  of  signal  for  each  user  becomes 
correlated  when  the  noise  components  are  whitened.  This  self-interference  degrades  system 
performance.  Each  element  of  the  combined  result,  u  =  [u\,  in,...,  uk],  which  is  also  the 
decision  variable  for  each  user,  equals 

w*=(D*  /c/i )l!\\wk  =  c//7 ([R  ']*,*)  1  Ckbk  +  c///D/  lnw/(,  k  =  1,2 ,...,K.  (2.13) 
We  can  calculate  that  the  noise  component  for  user  k  is  white  Gaussian  with  power 
WA-([R  '  1 C//7.  Hence,  the  instantaneous  BER  for  user  k  can  be  expressed  in  terms  of  y/,k, 

Pek=Q{^2^\[ RVi),  k=\,2,...,K.  (2.14) 

It  has  been  mathematically  proved  that  the  perfonnance  of  RDD  is  better  than  MDD 
(see  Appendix  of  [HS94]).  On  the  other  hand,  MDD  has  lower  complexity,  since  the  KNxKN 
matrix  inversion  for  MDD  is  not  dependent  on  the  channel  coefficients  and  only  need  to  be 
done  when  the  set  of  active  signature  waveforms  changes;  while  RDD  require  the  inversion 
of  the  KxK  matrix  Rc  during  almost  each  symbol  interval  because  the  matrix  is  dependent  on 
the  channel  conditions. 
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2.3  Decision-Feedback  Decorrelating  Multiuser  Detection  Techniques 

For  single-path  channels,  the  nonlinear  DF  decorrelator  achieves  better  perfonnance 
than  linear  decorrelator  [Due93].  Similarly,  we  will  see  that  in  the  case  of  multipath 
channels,  the  DF  approaches  outperform  the  linear  decorrelators  discussed  above.  In  section 
2.3,  we  will  discuss  the  two  DF  MUDs  which  employ  the  same  MF  structure  as  RDD  and 
MDD,  respectively. 

2.3.1  RAKE  Decorrelating  Decision-Feedback  Receiver  (RDDFR) 


b\ 

b2 


bK 


Figure  2.3  RAKE  Decorrelating  Decision  Feedback  Multiuser  Receiver  (RDDFR)  for  a 
A-user  iV-channel  paths/user  system 


As  we  have  discussed  for  the  method  of  RDD,  if  we  use  RAKE  matched  filters  at  the 
front  end  of  the  receiver,  the  resulting  output  is  given  by  equation  (2.6),  in  which  the 
correlation  matrix  Rc  =  C/;RC  is  symmetric  and  positive  definite.  Therefore,  by  Cholesky 
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factorization,  we  obtain  Rc  =  F/^Fc,  in  which  Fc  =  {fcij}  kxk  is  an  upper  triangular  matrix.  In 
the  proposed  RDDFR  detector,  following  the  RAKE  MF  bank  is  a  noise-whitening  feed¬ 
forward  filter,  as  shown  in  Fig.  2.3.  The  output  vector  of  the  filter  Fc_//  is  equal  to 

y  =  Fc_//y  =  Fcb  +  z,  (2.15) 

where  vector  z  =  [z\,  Z2,...,  zk\ 1  =  Fc  7/n.  As  shown  in  section  2.2.1,  the  autocorrelation 
matrix  of  n  is  AoRc.  Consequently,  the  autocorrelation  matrix  of  z  is  /Vo I,  that  is,  the  noise  is 
whitened  by  the  feed-forward  filter. 

Similar  to  the  DF  MUD  for  single-path  channels,  in  order  to  reduce  the  error 
propagation  effect,  we  assume  the  K  users  are  ordered  according  to  their  transmit  signal 
power,  i.e.,  user  1  is  the  strongest  and  user  K  is  the  weakest.  Define  the  feedback  filter  as 

B  =  Fc-  diag(Fc),  (2.16) 

which  is  a  lower  triangular  matrix  with  all-zero  diagonal.  As  shown  in  Fig.  2.3,  first  the 
symbol  of  user  1  is  estimated.  From  equation  (2.15),  we  know  the  feed-forward  filter  output 

for  user  1  isyi  =  fc\,\b\  +  z\.  Hence,  the  decision  variable  for  user  1  should  be 
Suppose  the  estimated  symbol  corresponding  to  b\  is  denoted  by  b\.  For  user  i,  i  =  2,  3,...,  K, 

i- 1 

the  decision  variable  is /„■/'(  ly-  ^Lfcijbj  T Z/ ) .  If  define  a  vector  b  —  [  b \ ,  b2  b/(] the 

7=1 

decision  device  inputs  for  all  K  users  can  be  expressed  by 

diag(¥cy\y  -  Bb)  =  b  +  diag{ Fc)'1Fc(b-b)  +  diag{ Fc)'‘z  (2.17) 
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It  is  seen  that  the  system  perfonnance  is  degraded  by  error  propagation,  which  is  reflected  by 
the  second  term  in  equation  (2.17).  The  ideal  SER  can  be  obtained  by  assuming  there  is  no 

decision  error,  i.e.,  b  =  b.  For  the  Mi  user,  k  =  1,  2,. . .,  K,  the  ideal  BER  is 

Pek  =  Q(^2fck/yhk).  (2.18) 

Now  we  compare  the  theoretical  performance  of  the  nonlinear  RDDFR  and  that  of  the 
linear  RDD  given  by  equation  (2.8).  Note  that  Rc  H=FC  ^Fc  1  and  therefore 
(Rc_l)tt>  [( Fc  ')id]2=/kk2  ((Rc  ')kk=  [( FcF')u]2  is  satisfied  only  when  k  =  1).  Thus  the  ideal 
RDDFR  outperforms  the  RDD  (the  optimal  decorrelating  linear  MUD)  for  all  users  k  >  1 ;  for 
the  first  user,  these  two  methods  have  the  same  theoretical  performance.  Since  it  has  been 
proved  in  [HS94]  that  the  RDD  outperfonns  the  MDD,  the  RDDFR  also  outperforms  MDD 
theoretically.  With  the  analysis  similar  to  that  for  the  optimality  of  the  RDD,  it  is  easy  to 
conclude  that  the  ideal  RDDFR  is  the  optimal  decorrelating  decision-feedback  MUD. 
However,  the  practical  performance  of  RDDFR  is  degraded  by  the  error  propagation  effect. 
Similar  to  the  DF  MUD  for  AWGN  channels  [Due93],  the  error  propagation  effect  can  be 
mitigated  by  ordering  the  users  from  the  strongest  to  the  weakest. 

2.3.2  Multipath  Decorrelating  Decision-Feedback  Receiver  (MDDFR) 

The  MDDFR  was  proposed  in  [SK97].  In  this  method,  the  KN  correlators  are 
employed  at  the  front  end  of  the  receiver  in  the  same  way  as  for  MDD  in  Fig.  2.4,  the  output 
of  the  correlators  is  given  by  equation  (2.9).  From  the  definition  of  R  in  equation  (2.3),  it  can 
be  observed  that  R  is  symmetric  and  positive  definite.  Thus  by  Cholesky  factorization, 
R  =  FrF,  in  which  F  =  {/i^knxkn  is  a  real  lower  triangular  matrix.  As  shown  in  Fig.  2.4, 
following  the  MF  bank  is  a  linear  filter  F"r,  the  resulting  output  is 
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F_Ty  =  FCb  +  nH, 


(2.19) 


where  nw  =  (F7)  'n  is  a  zero-mean  white  Gaussian  noise  vector. 

Divide  the  NKxNK  matrix  F  into  K 2  square  blocks,  each  block  is  an  NxN  matrix;  and 
represent  the  block  on  the  /th  row  /th  column  by  [F]y-  .  The  maximal-ratio  combining  (MRC) 
is  performed  for  user  1  first.  The  first  N  branches  of  the  whitening  filter  outputs  are  filtered 
by  Ci^Fji/,  thus  the  input  to  the  decision  device  for  user  1  is 

vi  =  Ci^FJu7  [F]ucihi  +  zi,  (2.20) 

A 

where  zi  is  the  noise  component.  The  symbol  decision  for  user  1,  denoted  by  b i,  is  fed  back 
to  the  remaining  users  to  help  to  cancel  the  MAI  due  to  user  1 .  In  this  way,  using  the  channel 
coefficients  (which  are  assumed  to  be  perfectly  known  at  the  base  station)  and  the  decisions 
of  the  previous  k- 1  users,  the  MAI  cancellation  and  MRC  for  user  k,  k  =  2,  3,...,  K,  is 
performed  as 

k-\ 

Vk  =  C//V [ F] /a) ?X[ F]  *,/<:,■(. b,  -  bj )  +  CkHmKk)TmKK*kbk  +  Z/, ,  (2.21) 

(=i 

where  is  the  symbol  decision  corresponding  to  bj.  Similar  to  the  decorrelating  DF  MUD  for 
single-path  channels  [Due93],  to  minimize  error  propagation,  the  K  users  should  be  sorted 
from  the  strongest  to  the  weakest,  i.e.,  the  first  user  should  be  the  one  whose  signal  has  the 
strongest  power,  and  the  last  user  is  the  weakest  one.  By  assuming  no  error  propagation,  we 
can  obtain  the  ideal  SER 

Pek  =  Q(^cklf[F]k/[F]kkcm).  (2.22) 
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The  MDDFR  is  a  suboptimal  MUD  even  if  the  error  propagation  effect  is  ignored.  An 
intuitive  explanation  is  that  the  KN  correlators  at  the  front  end  of  receiver  are  not  the  matched 
filters  perfectly  matched  to  the  multipath  fading  channels.  The  theoretical  performance 
comparison  between  the  MDDFR  and  the  optimal  RDDFR  follows  the  same  derivation  as  in 
the  comparison  between  the  PreRAKETHP  and  MDTHP,  which  is  provided  in  section  4.3.2. 

Although  the  error  rate  of  the  MDDFR  is  higher  than  that  of  RDDFR,  it  should  be 
noted  that  MDDFR  has  the  advantage  of  low  computational  complexity,  since  the  operation 
of  matrix  factorization  is  made  on  R,  which  is  independent  of  channel  state  information. 

The  nonlinear  MDDFR  is  closely  related  to  the  linear  MDD,  noticing  that  they  have 
the  identical  front-end  correlator  bank.  Compare  the  decision  statistic  of  MDTHP, 
C//7[ F] /,/,7  [ F]/,/tc/t,  with  that  of  MDD,  C//7(  [R' 1  ]/./,)" 1 C/,.  Based  on  the  theorem  for  the  inverse  of  a 
partitioned  symmetric  matrix  [HJ85],  it  is  easy  to  prove  that  Ca^FJaa^FJa-a-Ca  >  C//^ [R"1  ]/,/,)" 'ca 
(the  equality  is  satisfied  for  /  =  1).  Therefore,  for  user  1  MDDFR  and  MDD  have  the  same 
performance,  and  for  other  users  MDDFR  outperforms  MDD,  if  the  error  propagation  effect 
is  not  considered. 
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Figure  2.4  Multipath  Decorrelating  Decision  Feedback  Multiuser  Receiver  (MDDFR) 
for  a  3-user  iV-channel  paths/user  system 


21 


2.4  Numerical  Analysis 

We  consider  the  uplink  of  an  8-user,  4  channel  paths/user  BPSK  system.  Suppose  the 
transmit  power  for  all  users  are  equal  and  the  signature  sequences  for  all  users  are  orthogonal 
Hadamard  sequences  with  32-chip  length.  The  user  ordering  for  nonlinear  DF  MUD  is  not 
considered  in  this  example.  In  Fig.  2.5,  the  performance  of  the  dissussed  linear  decorrelating 
methods  RDD  and  MDD  and  nonlinear  decorrelating  RDDFR  and  MDDFR  are  observed. 
The  BER  is  averaged  over  all  users.  The  single  user  bound  (SUB)  and  conventional  RAKE 
receiver  are  also  simulated  as  references.  It  is  obvious  that  the  conventional  RAKE  receiver 
has  poor  performance  in  multiuser  channels.  Although  theorectical  analysis  shows  the 
nonlinear  RDDFR  and  MDDFR  have  better  performance  than  RDD  and  MDD,  respectively, 
the  practical  performance  of  the  nonlinear  methods  is  degraded  by  the  error  propagation.  It 
can  be  observed  that  the  perfonnance  advantage  of  the  nonlinear  methods  relative  to  the 
linear  methods  diminishes  as  transmit  power  increases.  It  is  confirmed  by  this  example  that 
the  optimal  decorrelating  linear  MUD  (RDD)  and  nonlinear  MUD  (RDDFR)  respectively 
outperform  the  subop timal  linear  MUD  (MDD)  and  nonlinear  MUD  (MDDFR). 


22 


Average  BER 


6  8  10  12  14  16  18  20  22 

Transmit  Eb/NO  (dB) 


Figure  2.5  Performance  comparison  of  different  MUD  approaches,  8  users,  4  channel 
paths/user,  BPSK,  equal  transmit  power  for  all  users. 
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Chapter  3 


TOMLINSON-HARASHIMA  MULTIUSER  PRECODING  (THP)  FOR 

SINGLE-PATH  CHANNELS 

3.1  THP  for  Single-path  AWGN  Channels 

3.1.1  Single-path  Centralized  Channels  (CC) 


Figure  3.1  Centralized  Channel  Model 


The  system  diagram  of  CC  model  is  shown  in  Fig.  3.1.  In  the  CC  model,  both  the 
transmitter  and  the  receiver  are  centralized  for  all  users  (or  all  sub-channels  in  multi-channel 
systems),  which  make  it  possible  to  perform  the  Tx-Rx  joint  optimization  with  balanced 
complexity  for  the  transmitter  and  receiver.  The  CC  model  with  MIMO  DF  detector  has  been 
applied  in  multicarrier  [VCLM98],  space-time  [LP02,  ANC01]  and  other  MIMO  systems 
[Due92], 

For  a  A-user  system,  denote  the  transmit  data  symbols  during  the  symbol  interval  of 
interest  [0,  T)  by  a  vector  b  =  (b\,  bi,  ...,  bK)T ■  For  high-speed  downlink  CDMA  channels,  the 
higher-order  modulation  is  favorable  because  the  data  transmission  efficiency  can  be 
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improved  without  increasing  MAI  [TSG01,  BBGOO].  In  this  chapter,  we  use  pulse  amplitude 
modulation  (PAM)  as  an  example  in  derivation,  while  quadrature  amplitude  modulation 
(QAM)  is  also  considered  in  numerical  experiments.  Suppose  the  symbols  are  M- ary  PAM 
modulated  with  the  equivalent  baseband  minimum  Euclidean  distance  2 Ai}  i.e., 
b,e{-(M-l)Ai, -(M-3)Ai,  ( M-3 )A „  ( M- 1  )A , } ,  Vz  =  1,  2,  ...,  K.  Let  the  vector  of 

nonnalized  signature  waveforms  for  K  users  be  s (t)  =  S2(t),  ...,  .sk(i)]1.  For  the  CC 

systems,  the  equivalent  low-pass  received  signal  at  the  centralized  receiver  is 

K 

r(t)  =  X  bjSi(i)+n(t).  In  the  conventional  receiver,  r(t)  is  fed  to  the  filters  matched  to  the  K 

7=1 

T 

users’  signature  waveforms.  The  resulting  output  is  y,=fr(t)si(t)dt,  i  =  1,2,  If  we 

o 

T 

define  the  correlation  between  two  signature  sequences  as  Rij=jsl{t)s]{i)dt, 

o 

V  i,j=  1,  2  ,...,  K,  and  the  KxK  correlation  matrix  R  =  {Rjj},  the  matched  filter  (MF)  output 
vector  y  =  [yi,  y2,  . . . ,  }!k]  T  satisfies 

y  =  Rb  +  n.  (3.1) 

In  equation  (3.1),  n=[«i,  n2,  ...,  nK]r  is  a  zero-mean  Gaussian  noise  vector  with  the  elements 

T 

Hi  =  fn(t)Sj(t)dt,  i  =  1,  2  and  the  autocorrelation  matrix  NoR.  Note  that  we  retain  the 

o 

tenninology  of  CDMA  systems  for  the  CC  model.  Although  its  primary  applications  are  in 
multicarrier,  space-time  and  other  systems,  the  mathematical  models  for  these  applications 
are  equivalent  to  those  used  in  CDMA  systems. 
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3.1.2  Single-path  Decentralized  Channels  (DC) 

As  shown  in  Fig.  3.2,  the  DC  model  corresponds  to  the  downlink  of  CDMA  systems. 
In  this  channel  model,  the  transmitter  is  centralized;  the  receivers  of  different  users  are 
separate,  resource-constrained  and  do  not  have  the  knowledge  of  spreading  codes  of  other 
users.  Therefore,  the  processing  burden  is  placed  in  the  transmitter. 


Noise 


Rx 


Rx 


Rx 


Figure  3.2  Decentralized  Channel  Model 


For  the  DC  model,  e.g.,  the  downlink  of  a  CDMA  system,  the  equivalent  low-pass 

K 

received  signal  at  the  z'th  receiver  is  r,(t)  =  X^a(0+«/(0  ,  where  n,(t)  represents  white 

i=  1 


Gaussian  noise  with  zero  mean  and  power  spectrum  density  No.  In  the  zth  user’s  receiver,  r,(t) 

T 

is  fed  to  the  fdter  matched  to  .s,(/).  The  resulting  output  is  v,=JV,(0sv(0df  Vz  =  1,2,  ...,  A.  The 

o 
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MF  output  has  the  same  form  as  equation  (3.1),  but  for  DC  model  the  autocorrelation  matrix 
of  n  is  No  I.  (I  is  the  KxK  identity  matrix.)  In  practice,  these  channels  are  not  degraded  by 
MAI  due  to  the  orthogonal  signature  sequences  and  synchronous  transmission,  and  thus,  do 
not  require  Tx  precoding.  However,  as  in  [VJ98,  TC94],  we  employ  this  simple  model  with 
non-orthogonal  users’  codes  to  illustrate  the  general  principle  of  multiuser  THP  precoding. 
3.1.3  THP  for  Centralized  Channel  (THP-CC) 

From  equation  (3.1),  we  observe  that  MAI  is  caused  by  the  non-diagonal  elements  of 
matrix  R.  For  CC  systems,  as  well  as  for  the  uplink  of  DS-CDMA,  the  interference  can  be 
conveniently  canceled  by  the  linear  decorrelating  MUD  [Ver98]  The  output  is  R  y  =b+z. 
Since  the  linear  MUD  enhances  noise,  the  nonlinear  DF-MUD  was  proposed  in  [Due93].  The 
positive  definite  symmetric  matrix  R  is  decomposed  by  Cholesky  factorization  [GVL96], 

R  =  FrF,  (3.2) 

where  F  is  a  lower  triangular  matrix.  For  DF-MUD,  a  nonlinear  feedback  loop  and  a  feed¬ 
forward  filter  are  used  to  cancel  the  partial  MAI  expressed  by  F  and  F7  respectively.  This 
approach  avoids  noise  enhancement  and  outperforms  linear  decorrelating  method;  however, 
because  the  feedback  is  based  on  past  decisions,  the  performance  is  degraded  by  error 
propagation. 

Fig.  3.3  shows  the  system  diagram  of  THP-CC.  The  feedback  filter  in  the  transmitter 
is  defined  as 

Bcc=  {Bjj}KxK=  diag( F )  'xF-I,  (3.3) 

where  F  is  as  in  equation  (3.2),  and  diag( F)'1  =  diag{¥)  is  the  diagonal  matrix  that  contains 
the  diagonal  elements  of  F-1.  Thus,  B cc  is  a  lower  triangular  matrix  with  zeros  along  the 
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diagonal.  A  bank  of  mod-2M  operators  are  used  to  limit  the  transmit  power.  For  user  i,  given 


an  arbitrary  real  input  /?,  the  output  of  the  mod-2M  operator  ft  satisfies  [HA ;  =  fd  /At+  2Mc[, 


Decision 


(a)  Transmitter  (b)  Receiver 

Figure  3.3  Diagram  of  THP  for  a  Centralized-Channel  (CC)  System 


where  dt  is  the  integer  to  render  [HA,  within  (-M,  M\.  In  Fig.  3.3(a),  the  output  vector  of  the 
mod-2M  operator  bank  v  =  [vi,  V2,  vK]T  satisfies 

v  =  b-Bccv+2MAd  =  (BcrH)''(b+2A/Ad),  (3 .4) 

where  A  =  diag{Ai}KxK  and  d  =  [d\,  di,  ...,  di[\T  is  an  integer  vector.  For  the  /'th  user, 
i  =  1,  2,  ..., K,  di  is  chosen  to  guarantee  v,-  in  the  range  {-AM,  AM].  Equation  (3.4)  is 
equivalent  to  (Bcc+  I)v  =  b  +  2MAd.  Note  that  (Bcc+  I)  is  a  lower  triangular  matrix  with 
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ones  along  the  diagonal.  Thus,  v\=b\  and  d\=0,  i.e.,  mod-2M  is  not  required  for  user  1;  for  the 
/'th  user,  i  =  2,  3,  we  can  successively  determine  dt  and  v,  based  on  the  values  of  vi, 
V2,...,  Vi- 1, 

+  (3.5) 

j=l 

The  feedback  structure  is  similar  to  that  of  DF-MUD.  However,  because  the  feedback  in  the 
transmitter  is  based  on  the  actual  values  of  user  data  instead  of  past  decisions,  error 
propagation  is  avoided. 

The  received  signal  is  r(t )  =  s T(t)\+n(f).  In  Fig.  3.3(b),  the  output  vector  of  the  MF 
bank,  y  =  [y uy2,  ... ,yK]T ,  is 

y  =  Rv  +  n.  (3.6) 

Given  (3.3)  and  (3.4),  equation  (3.6)  is  equivalent  to 

y  =  F  Tdiag(  F  )(b+2/V/Ad )  +  n.  (3.7) 

To  recover  the  original  data  b,  the  feed-forward  fdter  should  be  defined  as 

Gcc  =  diag(F)~]  F-7,  (3.8) 

Where  F  7  =  (F  l)r=  IF7)-1,  thus  the  output  of  Gcc  is 

Gccy  =  b  +  2MAd  +  zee,  (3-9) 

where  zee  =  Gcc  n.  Equation  (3.9)  shows  that  MAI  has  been  completely  eliminated  at  the 
output  of  the  feed-forward  filter.  Finally,  the  mod-2M  operations  are  performed  for  users  2 
through  K  (mod-2M  is  not  required  for  user  1  because  <ii  =  0).  The  input  vector  to  the 
detector  is  b+zee-  The  autocorrelation  matrix  of  zee  is  Nodiag(¥ )  ,  that  is,  the  noise  is 
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whitened.  The  average  transmit  signal  to  noise  ratio  (SNR)  per  bit  for  the  z'th  user,  /hi.  equals 


[ProOl] 

=  Mu  =  (M2-l)Ar 
/bi~  No  6Aolog2 M  ’  1 


Then  the  theoretical  symbol  error  rate  (SER)  is 


(3.10) 


where  fu  is  the  zth  diagonal  element  of  F.  The  SER  given  by  (3.11)  is  identical  to  that  of  DF- 
MUD  assuming  no  error  propagation.  It  is  observed  that  the  order  of  users  affects 
performance.  For  the  first  user,  the  ideal  perfonnance  of  THP  equals  that  of  the  linear 
decorrelating  MUD;  for  the  last  user,  the  ideal  SER  achieves  the  single  user  bound  (SUB) 
sine  g/kk  =  1. 

Note  that  in  the  transmitter  the  mod-2M  operator  output  v,  is  in  the  range  (-AM, 
A,M |,  which  is  larger  than  the  range  of  original  signal  b,,  [-Ai(M- 1),  At(M- 1)],  i.e.,  the  signal 
power  reduced  by  modulo  operation  is  still  larger  than  the  original  signal  power.  This  power 
penalty  diminishes  as  M  increases  and  can  be  ignored  for  large  M  (M> 8)  [LM94].  It  also 
should  be  noted  the  actual  performance  is  slightly  worse  than  the  ideal  perfonnance  because 
of  the  “end  effect”  of  modulo  operations  in  the  receiver.  The  property  that  the  detection 
enors  for  outer  signals  are  less  likely  than  for  other  signals  is  lost  due  to  the  mod-2M 
operations  in  the  receiver,  because  any  outer  symbol  will  be  re-assigned  a  bounded 
magnitude.  However,  as  observed  in  the  simulation  examples,  the  end  effect  also  diminishes 
as  M  increases  [LM94]. 
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3.1.4  THP  for  Decentralized  Channel  (THP-DC) 

The  scheme  proposed  in  the  previous  section  can  be  applied  in  DC  systems  by  using 
adaptive  linear  processing  in  the  receivers.  However,  it  is  often  desirable  to  make  the 
receivers  of  DC  systems  (e.g.  the  mobile  stations  in  downlink  CDMA)  as  simple  as  possible. 
Therefore,  in  this  section  we  propose  a  different  THP  design  in  which  the  receivers  are  as 
simple  as  in  a  single-user  system.  The  diagram  of  this  design  is  shown  in  Fig.  3.4.  The 
relocation  of  the  feed-forward  filter  from  the  receiver  to  the  transmitter  causes  the  changes  in 
the  structures  of  the  feedback  and  the  feed-forward  filters  denoted  by  B Dc  and  G dc, 
respectively.  The  signal  received  at  the  /th  user’s  receiver  site  can  be  expressed  as 
r,(t)  =  sT(t)Gi)c\+n,(t).  With  transmitter  precoding,  the  output  vector  of  the  scaled  MF  bank  is 
y  =  J/ag(F)'1(RGz)cV  +  n)  =  diagffl 1  [  F  '/FG/X<  B/x+ 1 )" 1  (b+2/V/Ad )  +  n],  (3.12) 

Although  the  scaled  MF  in  the  /th  user  receiver  needs  the  knowledge  of  fa,  this  requirement 
does  not  significantly  increase  receiver  complexity.  To  cancel  MAI,  we  define  B Dc  and  G dc 
as 

BDc  =  diag(F)-'FT-l,  (3.13) 

GDc  =  F"1.  (3.14) 

From  (3.13),  the  feedback  filter  B/x;- is  now  upper  triangular  with  zeros  along  the  diagonal. 

Thus  the  computation  of  the  elements  of  v  is  in  the  reverse  order  relative  to  CC  systems,  i.e., 

vK=  bK  and  Ak=  0  (the  mod-2M  operation  is  not  required  for  the  last  user).  For 

K 

i  =  K- 1,  K- 2,...,  1,  Vi=(bj-  YjBijVj)+2MAjdj.  It  can  be  shown  that  the  feed-forward  filter  given 

j=i+ 1 

by  equation  (3.14)  does  not  increase  the  average  transmit  power  and  the  power  scaling  factor 
is  not  required. 
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Substituting  (3.13)  and  (3.14)  into  (3.12),  we  obtain  y  =  b+2MAd+zDc.  Mod-2M 
operations  are  then  applied  to  eliminate  the  tenn  2MAd.  The  noise  component 
z dc=  diagiF)'  n  is  whitened  since  its  autocorrelation  matrix  is  Nodiag( F)'  .  The  ideal  SER  of 
THP-DC  is  also  given  by  (3.11). 

In  summary,  THP-CC,  THP-DC  and  DF-MUD  have  the  same  ideal  performance 
provided  that  the  effect  of  mod-2M  operation  in  the  THP  schemes  and  the  error  propagation 
in  DF-MUD  are  ignored.  Since  DF-MUD  improves  upon  the  linear  decorrelating  MUD  for 
all  but  the  first  user  [Due93],  the  proposed  THP  methods  also  have  better  perfonnance  than 
the  linear  decorrelating  MUD.  While  this  conclusion  is  valuable  for  CC  systems,  it  is  more 
meaningful  to  compare  THP-DC  with  the  linear  transmitter  precoding  methods.  For  example, 
if  equal  transmit  powers  are  employed  for  all  users  and  all  the  transmit  signals  have  the  same 
cross-correlation,  the  linear  decorrelating  precoding  has  the  same  performance  as  the  linear 
decorrelating  MUD,  and  is  outperformed  by  THP.  The  advantage  of  THP  over  linear 
schemes  will  be  further  demonstrated  in  numerical  experiments. 
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Channel 

(a)  Transmitter  (b)  Receiver 


Figure  3.4  Diagram  of  THP  for  a  Decentralized-Channel  (DC)  System 


33 


3.2  THP  for  Flat  Rayleigh  Fading  Channels 


In  this  section,  we  extend  the  THP  approach  to  flat  Rayleigh  fading  channels.  In  the 
symbol  interval  of  interest,  the  channel  gain  for  the  z'th  user  is  denoted  as  ct  =  a^''’1 ,  where  the 
envelope  a,- has  Rayleigh  distribution  and  the  phase  (f>,  is  uniformly  distributed  over  (-71,  ji\, 
i=  1,2,  . . .,  K.  Define  the  channel  gain  matrix  for  K users  as  C  =  diag{ci)  kxk- 

For  the  DC  system,  the  signal  received  at  the  z'th  user  receiver  is  r,(t)  =  cfir(t)Y+n,(f). 
With  the  THP-DC,  the  output  vector  of  the  MF  bank  equals 

y  =  CRv+n  =  CJz'ag(F)(b+2MAd)+n.  (3.15) 

The  received  signals  are  detected  coherently,  scaled  and  passed  through  the  mod-2M 
operators.  The  input  of  the  detection  device  in  the  z'th  user  receiver  is  afubi  +  z?;.  The  average 
SNR  per  bit  at  the  input  of  the  detection  device  /bi  has  the  mean 


2  ,  2 


Y m  ~  .  ,,  E{(Xj  },  z  1,2, 

6N0  log2  M 


(3.16) 


For  the  z'th  user,  the  average  SER  is  given  by  [ProOl] 


Pe,  = 


M- 1 


M 


1- 


3(log2  M)yb 


'M2  -  l  +  3(log 2M)yi 


bi  J 


(3.17) 


For  the  CC  model  over  the  Rayleigh  fading  channel,  the  received  signal  is 


K 


lit)  =  YuciViSi{t)  +  n(t).  The  MF  bank  in  the  receiver  is  followed  by  the  diagonal  matrix  C  , 

7=1 


resulting  in  the  output  y  =  C^RCv  +  C^n.  Define  F  =  FC,  which  is  also  a  lower  triangular 
matrix,  then  the  feedback  and  feed-forward  filters  are  the  same  as  (3.3)  and  (3.8)  except  that 

F  replaces  F.  The  output  of  the  feed-forward  filter  is  b+2MAd+zCc,  where  z Cc  =  GccC^n  is 
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white  noise  vector  with  auto-correlation  matrix  N odiag{V)~2C~2 .  Thus,  the  mean  of  average 
SNR  per  bit  at  the  detector  input  and  the  average  SER  are  still  given  by  (3.16)  and  (3.17), 
respectively. 

For  the  THP-CC  above,  we  assumed  perfect  knowledge  of  the  channel  coefficients  at 
the  transmitter.  More  generally,  the  channel  coefficients  are  required  for  Tx  precoding  in 
multipath  channels  and  asynchronous  channels.  In  practice,  these  coefficients  need  to  be  fed 
back  and/or  estimated.  For  rapidly  varying  fading  channels,  accurate  long  range  fading 
prediction  is  also  required  [DHHOO].  The  effect  of  quantized  feedback,  estimation  and 
prediction  errors  need  to  be  addressed  in  future  performance  investigation  of  various 
precoding  methods. 
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3.3  Numerical  Results  and  Analysis 

In  this  section,  we  provide  the  numerical  evaluations  of  the  perfonnance  of  THP 
designs  for  single -path  channels.  In  the  first  numerical  experiment,  consider  the  downlink  of 
a  2-user  M-PAM  CDMA  system  over  an  AWGN  channel.  Suppose  the  transmit  powers  of 
the  two  users  are  equal  and  the  two  signature  sequences  have  cross-correlation  R 12  =  0.5.  For 
user  1,  the  theoretical  SERs  of  THP  (both  THP-DC  and  THP-CC),  linear  decorrelating  MUD 
and  linear  decorrelating  precoding  are  equal  in  this  case.  In  Fig.  3.5,  we  demonstrate  the 
difference  between  the  simulation  result  for  THP-DC  and  the  theoretical  SER.  The  gap 
between  the  theoretical  and  the  simulated  SER  is  caused  by  the  “end  effect”  of  mod-2M 
operation  in  receiver  and  the  transmit  power  increase  due  to  mod-2M  operation  in 
transmitter.  The  “end  effect”  diminishes  as  M  or  the  transmit  SNR  increases.  The  transmit 
power  penalty  is  reduced  as  M  increases  and  is  not  related  to  the  original  transmit  SNR.  In 
this  example,  as  M  =  2,  4,  8,  16,  the  corresponding  power  penalty  for  user  1  are  0.67dB, 
0.08dB,  0.03dB  and  0.006dB,  respectively.  In  this  case,  the  performance  degradation  due  to 
mod-2M  operation  is  negligible  for  8-PAM  systems  with  transmit  SNR  larger  than  20dB,  and 
for  16-PAM  systems. 

Then  we  consider  another  example  of  a  heavily  loaded  3-user  8-PAM  system  over  an 
AWGN  channel.  The  cross-correlation  between  any  two  users  is  0.8.  All  users  have  equal 
powers.  Fig.  3.6  shows  the  performance  of  four  methods:  THP-DC,  linear  decorrelating 
transmitter  precoding  (Dp.),  DF-MUD,  and  linear  decorrelating  MUD  Receiver  (Dr.).  The 
performance  of  THP-CC  (not  shown)  is  very  close  to  that  for  THP-DC.  The  only  difference 
is  that  the  performance  degradation  due  to  the  mod-2M  operation  affects  users  1  through  K—  1 
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in  the  THP-DC  system  and  users  2  through  K  in  the  THP-CC  system.  The  SUB  and  the 
conventional  system  performance  are  also  plotted  for  comparison  purposes.  We  notice  that 
the  linear  decorrelating  methods  have  the  same  performance  for  all  users  because  of  equal 
transmit  powers  and  equal  signal  correlations.  For  THP-DC,  the  simulation  results  are  very 
close  to  the  theoretical  performance  since  the  end  effect  of  mod-2M  is  slight  for  8-PAM 
system.  User  3  benefits  the  most  from  using  THP.  The  THP  achieves  the  SUB  and  has  much 
lower  error  rate  than  DF-MUD  and  the  linear  methods.  The  worst  perfonnance  is  for  user  1 
and  is  theoretically  the  same  as  that  of  DF-MUD  for  user  1  and  the  linear  methods  for  all 
users.  For  user  2,  THP-DC  outperfonns  the  linear  methods  and  DF-MUD.  Clearly,  the  THP 
schemes  improve  on  the  linear  schemes  for  most  users.  Although  THP  methods  and  DF- 
MUD  achieve  the  same  performance  theoretically,  the  actual  performance  of  THP  is  better 
than  that  of  DF-MUD  in  this  case  since  the  latter  technique  suffers  from  error  propagation 
when  user  powers  are  similar. 

We  now  investigate  a  4-user,  16-PAM  system  in  a  flat  fading  channel.  Assume  the 
channel  gains  are  i.i.d.  Rayleigh  fading,  and  the  power  of  channel  fading  is  nonnalized.  Let 
the  signal  cross-correlation  between  any  two  users  be  0.8,  and  A\  :  Ai  :  A3  :  Af  =  8:4:2: 1 . 
Thus  the  first  user  is  the  strongest,  and  the  last  user  is  the  weakest.  The  two  linear 
decorrelating  approaches  have  the  same  perfonnance  under  these  given  conditions.  Since 
THP  designs  for  CC  and  DC  systems  have  similar  performance,  we  only  compare  THP-DC 
with  the  linear  decorrelating  methods.  As  shown  in  Fig.  3.7,  user  1  achieves  the  best 
performance  since  its  signal  is  the  strongest.  The  three  approaches  result  in  similar  SER  for 
user  1  and  user  2  as  expected;  while  for  the  weakest  users  (user  3  and  user  4)  it  is  shown  that 
THP-DC  significantly  outperforms  the  linear  decorrelating  schemes.  The  order  of  users 
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presented  here  aids  the  weakest  user.  However,  on  the  downlink,  it  might  be  desirable  to 
order  the  users  in  the  opposite  order,  so  that  a  further  user  which  needs  larger  transmit  power 
is  aided  by  the  THP  most.  Thus,  the  overall  average  and  peak  transmit  power  can  be  reduced. 
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SER  of  User  1 
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Figure  3.5  Symbol  error  rate  for  user  1  of  a  2-user  system  in  AWGN  channel,  M-PAM, 
^1=^2,  ^12  =  0.5. 
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SER 
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Transmit  Eb/NO  (dB) 

Figure  3.6  THP-DC,  DF-MUD,  decorrelating  precoding  and  decorrelating  MUD  in 
AWGN  channels,  3  users,  8-PAM,  A\=A2=A3,  Rn  =  Rn  =  R23  =  0.8. 
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Figure  3.7  THP,  decorrelating  precoding  and  decorrelating  MUD  in  Rayleigh  fading 
channel,  4  users,  16-PAM,  A2:  A2:  A2:  A4  =  8:4:2: 1,  signal  cross¬ 
correlation  0.8. 
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Chapter  4 


TOMLINSON-HARASHIMA  MULTIUSER  PRECODING  (THP)  FOR 

MULTIPATH  FADING  CHANNELS 

4.1  Frequency-selective  Fading  Downlink  CDMA  Channel  Model 

Consider  the  downlink  channel  of  a  A'-user  DS-CDMA  system.  Suppose  the  transmitted 
signals  are  subject  to  frequency  selective  slow  fading  with  N  resolvable  multipath 
components  for  every  user.  By  assuming  that  the  multipath  spread  is  small  relative  to  the 
symbol  duration,  the  inter-symbol  interference  (ISI)  is  minor  and  ignored  in  the  following 

discussions.  For  user  i,  cut  =  «,>e/%  represents  the  gain  of  the  nth  path  component, 
Vi  =  1,2,..., A  and  n  =  0,1,...,  N- 1.  The  received  equivalent  baseband  signal  at  the  zth  mobile 
user  receiver  site  can  be  expressed  by 

K  N- 1 

r,{t)  =  X  X  culbksk(t-n  Tc)  +  n,(t),  (4. 1) 

k=\  n= 0 

where  bk  is  the  data  symbol  for  the  Mi  user  in  the  symbol  interval  [0,7),  silt)  is  the  signature 
sequence  for  the  7th  user,  Tc  is  the  chip  duration,  and  nt{t)  is  complex  white  Gaussian  noise 
with  zero  mean  and  variance  No.  Define  the  channel  gain  vector  for  the  z'th  user 

_ ,c;yv-i],  the  column  vector  of  datasymbols  for  all  K  users  b=[hi,h2,...,hA)r,  and 

the  diagonal  matrix  A=diag{A\^2,- ■  -Ak} ■ 

To  avoid  the  transmit  power  increase  due  to  multipath  Pre-RAKE  combining,  we  use 
the  normalized  fading  channel  coefficients  in  the  transmitter.  For  the  zth  user,  define  the 
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/ 

N- 1 

nomialization  factor  5,  = 

«=0 

V 


ci>n  =  Sj  ci>n  ,  and  the  normalized  vector  corresponding  to  c,  is  c,  =  [c,-,o,  h,  1 , . . . ,  c/,/v- 1  ]  ■  For  the 
convenience  of  derivations,  for  all  K  users  define  a  K- row,  (A'x/Vi-column  channel  gain 
matrix  C  =  diag{cuC2,--;Cx}  and  the  corresponding  normalized  matrix 

C  =  diag{ Ci,  C2,...,Cjf}.  Let  S  =  diag{S\,  S2,...,Sk},  then  C  =  SC. 
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4.2  PreRAKE  Multipath  Diversity  Combining 


b\ 


XC 


1,1 


spread  with 
S\(t-(N-\)TL) 


multipath 

channel 


^Xcuv-i 


spread  with 
s,(X-(N-2)T() 


spread  with 
^i(t) 


Figure  4.1  Diagram  of  PreRAKE  System,  single  user,  A  resolvable  channel  paths 


For  direct-sequence  spread  spectrum  (DS-SS)  communications  (single-user  systems), 
RAKE  receiver  is  known  as  the  optimum  maximum  ratio  combining  detector,  where  signals 
of  the  individual  paths  are  added  in  a  way  to  accentuate  more  credible  signals  and  suppress 
less  credible  ones.  Compared  to  a  single-path  receiver,  the  RAKE  combiner  is  more 
complicated  and  needs  the  instantaneous  channel  information  for  setting  weighing  factors. 
For  the  downlink  channels,  in  order  to  minimize  the  mobile  unit  cost  and  power 
consumption,  PreRAKE  combiner  was  proposed  in  [EN93],  It  concentrates  all  the  processing 
required  for  the  RAKE  combination  at  the  BS  and  keeps  the  mobile  user  receiver  as  simple 
as  a  non-combining  single  path  receiver. 

In  PreRAKE  method,  the  RAKE  combining  process  is  performed  before 
transmission.  In  order  to  determine  the  combiner  parameters,  the  future  channel  gain 
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coefficients  have  to  be  known  by  transmitter.  This  information  could  be  obtained  by  feeding 
it  back  from  the  receivers.  However,  the  inaccuracy  due  to  the  delay  of  feedback  has  to  be 
considered,  especially  when  the  channel  fading  changes  fast.  For  rapidly  varying  fading 
channels,  the  channel  information  can  be  accurately  predicted  by  the  long-range  channel 
prediction  algorithm,  which  is  illustrated  in  section  4.4. 

The  system  diagram  of  a  PreRAKE  transmitter  and  receiver  is  shown  in  Fig.  4.1.  In 
the  transmitter,  the  spread  signal  is  delayed  and  weighed  for  a  number  of  times.  The 
estimated  channel  path  profile  for  future  transmission  is  used  to  set  the  weighing  factors.  To 
illustrate  the  principle  of  PreRAKE  combining,  we  only  consider  the  single-user  system  in 
this  section,  i.e.,  suppose  only  user  1  is  active.  With  PreRAKE  combining,  the  transmitted 
signal  is  given  by 

x(t)  =  Z  c\,n  b\S\(t-N-l+nTc).  (4.2) 

n=0 

After  traveling  through  the  A-path  fading  channel,  the  signal  received  at  the  receiver  is 

n(0  =  Z  Z  chichn*biSi(t-(N-\-n+l)Tc)  +  m(t)  (4.3) 

/=0  n= 0 

At  the  receiver  end,  the  desired  output  of  the  PreRAKE  combining  system  occurs  at  moment 
t  =  {N-\)TC.  The  receiver  is  a  simple  single  path  receiver  which  decodes  only  the  (A-l)th 
peak  of  the  matched  filter  output  for  each  transmitted  symbol, 

T+(N-1)Tc  nana  ^  T 

yi  =  f  r(t)s i (t-(N- 1 ) Tc)dt  =  b iZ  Z  cijc^n  fsi(t)si(t-(n-l)Tc)dt  +  n u  (4.4) 

(N-l)Tc  1=0  «= 0  0 


45 


where  n\  is  the  Gaussian  noise  component  with  zero  mean  and  variance  E{n\n\  }=No.  It  can 
be  observed  from  equation  (4.4)  that  there  is  self-interference  in  the  MF  output  caused  by  the 
multipath.  The  effect  of  self-interference  on  the  PreRAKE  performance  is  analyzed  in 
[ESN99].  If  the  selected  spreading  sequence  has  very  small  auto-correlation,  the  self¬ 
interference  can  be  ignored.  With  this  assumption,  the  received  SNR  is  given  by 

=  V  SNRTx,  (4.5) 

It  can  be  observed  that  the  Tx-based  PreRAKE  combiner  is  equivalent  to  the  Rx- 
based  RAKE  combiner.  By  moving  all  multipath  diversity  combining  operations  to 
transmitter,  the  PreRAKE  efficiently  reduces  the  complexity  of  MS. 


snrRx  = 


Eb 

No 


N- 1 

Z  ClA/ 

n= 0 
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4.3  THP  for  Frequency-selective  Fading  Channels 


4.3.1  PreRAKETHP 


b\ 

b2 


Figure  4.2  Transmitter  of  PreRAKETHP  for  a  2-user  2-channel  path/user  system 


The  transmitter  structure  of  PreRAKETHP  for  a  2-user  2-channel  path/user  system  is 
shown  in  Fig.  4.2.  The  basic  THP  decorrelating  structure  is  similar  to  that  for  the  single-path 
case,  and  is  followed  by  a  pre-RAKE  combiner.  The  FB  and  FF  filters  for  PreRAKETHP  are 
denoted  by  BP  and  Gp,  respectively.  With  the  same  derivation  as  for  single-path  channels,  the 
output  vector  of  the  mod-2M  operator  bank  satisfies  v  =  (BP+I)'1(b+2MAd).  As  we  will  see 
later,  BP  is  a  complex  matrix  related  to  the  channel  fading;  as  a  result,  the  inputs  of  the  mod- 
2M  operators  are  complex  numbers.  The  mod-2M  operations  are  performed  to  constrain  the 
magnitudes  of  the  real  part  and  imaginary  part  of  the  input  number  respectively.  Thus  for  the 
/th  user,  di  is  a  complex  number  with  integral  real  and  imaginary  parts.  As  in  Fig.  4.2,  the 
outputs  of  the  FF  filter  GP  are  multiplied  with  the  conjugates  of  normalized  fading 
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coefficients.  The  output  vector  of  the  pre-RAKE  combiner,  w  =  C  ^GpV,  has  KxN  elements. 
Following  the  signature  sequence  spreading,  the  ultimate  transmitted  signal  is  given  by 

X(t)  =  JLHWkN-lSk(t~ITc)-  (4-6) 

k= i  /=o 

The  equivalent  baseband  received  signal  at  the  zth  user  receiver  site  is  given  by 

rtd)  =  TjTjHci.nwkN-isk(t-ITc -nTc)  +  n,(t)  .  (4.7) 

k= 1  n= 0  1=0 

The  output  of  the  MF  is  sampled  at  t—  (N-1)TC,  and  for  the  zth  user  is  given  by 

T+(N-\)TC 

y,  =  jr,(t)st(t-(N-l)Tc)dt.  (4.8) 

(N-l)Tc 

The  cross-correlations  between  delayed  signature  sequences  can  be  represented  as 

T 

M™k=\Si(t)sk(t  +  mTc)dt,  me{-(N- 1),  1)}.  (4.9) 

0 

From  (4.7),  (4.8)  and  (4.9),  we  have 

K  TV— 1  N-l 

Vi  =  Z  Z  Z  Ci,n  Ma  w(k-]  )N+m+n+\  +  >V  ,  (4.10) 

kF=  1  n= 0  m=-(N- 1) 

where  n,  is  the  filtered  noise  component  with  power  No.  V  /,  ke{\,  2,  ...,  Tf},  define  the  Ax  A 
correlation  matrix  M,;/c  with  the y'th  row  equal  \M]~kJ  ,MfkJ  ,...,Mxk  J  ] ,j=  1,  2 Construct 

the  (TfxA)-row  and  (/txA)-column  matrix  M  by  superimposing  the  KxK  non-overlapping 
submatrices  M#,  i,ke{\,2,  Then  (4.10)  reduces  to 

y  =  CMw  +  n  =  S^CMC  //GP(BP+I)'1(b+2MAd)  +  n,  (4. 1 1) 
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where  y  =  [vi, yi,  ...,  Vk\T  and  n  =  [m,  m,  ...,  hk\T  ■  Define  Rp  =  CMC77  and 

/V  _  /V  /V  JJ 

Rp=  S"RP=  CMC  .  The  matrix  Rp  is  obviously  positive  definite.  Consequently,  it  can  be 
factorized  as  Fp^Fp  using  the  Cholesky  factorization,  where  F p=  {fpij}  k*k  is  complex  lower 
triangular  matrix  with  real  diagonal  elements.  The  normalized  matrix  corresponding  to  Fp  is 

F  ={fij}KxK=  SFp.  To  cancel  the  MAI,  the  FB  and  FF  filters  are  defined  as 

BP  =  JzaglF  y'F"-  I ,  (4.12) 

GP  =  F'1.  (4.13) 

Then  equation  (4.1 1)  can  be  simplified  as 

y  =  dzag(FP)(b+2MAd)  +  n.  (4.14) 

We  assume  the  channel  gains  and,  therefore,  ^p,,-  are  estimated  in  the  receivers.  Then  for  the 
z'th  user,  we  have/pf'y,-  =  b,+  2M4,r/(  +/p„"ln/.  The  tenn  2MA,ch  will  then  be  cancelled  by 
mod-2M  operation.  The  instantaneous  ideal  SER  of  the  z'th  user  is  given  by 

The  PreRAKETHP  described  above  is  related  to  several  precoders  and  MUDs  for 
frequency-selective  CDMA  channels.  It  represents  the  transmitter  precoding  version  of  the 
nonlinear  multiuser  detector  that  consists  of  the  RAKE  receiver  followed  by  the  DF  MUD. 
This  detector  can  be  viewed  as  a  nonlinear  (DF)  version  of  the  RAKE  Decorrelating  Detector 
(RDD)  [HS94],  the  optimum  linear  MUD  for  multipath  channels  (or,  equivalently,  of  the  pre- 
RDD  precoder  in  [Gun03]).  In  this  detector,  the  decorrelating  DF  processing  is  applied  at  the 
output  of  the  filter  bank  matched  to  the  signature  sequences  convolved  with  the  channel 


_  2(M  - 1)  " 
M 
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responses  for  all  users  (the  RAKE  receiver),  while  in  the  RDD,  the  matched  filter  hank  is 
followed  by  the  linear  decorrelating  detection.  Therefore,  this  DF  detector  is  the  optimal 
decorrelating  DF  MUD  for  the  frequency  selective  CDMA  channel.  Since  PreRAKETHP  is 
the  equivalent  Tx-based  implementation  of  this  DF  detector,  it  is  the  optimal  THP  method 
according  to  the  decorrelating  (ZF)  and  MMSE  criteria. 

To  compare  the  nonlinear  PreRAKETHP  with  the  linear  RDD,  first  we  note  that  the 
correlation  matrix  R  in  [HS94]  corresponds  to  Rp  above.  Since  RP  =  (FP  )  FP  ,  (R  )„  in 
equation  (7)  in  [HS94]  is  equal  to  or  larger  than  /p,,"  (equality  is  satisfied  for  i=  1 ). 
Comparing  the  error  rate  formulas  for  PreRAKETHP  (equation  (4.15))  and  RDD  (equation 
(7)  in  [HS94]),  we  observe  that  PreRAKETHP  outperforms  the  RDD  for  all  users  />  1 .  For 
the  first  user,  these  methods  have  the  same  SER.  For  the  last  user,  since 
fvKK  =  [Rp]xx  =  Ck[M\kkC.k" ,  the  performance  of  PreRAKETHP  given  by  equation  (4.15) 
achieves  that  of  the  RAKE  receiver  in  the  absence  of  MAI.  Similarly,  it  can  be  demonstrated 
that  the  PreRAKETHP  has  better  perfonnance  than  the  pre-RDD  in  [Gun03].  Since  it  has 
been  proved  in  [HS94]  that  the  RDD  outperforms  the  Multipath  Decorrelating  Detector 
(MDD)  [ZB96],  the  PreRAKETHP  also  outperforms  the  MDD. 


50 


4.3.2  Multipath  Decorrelating  THP  (MDTHP) 


Figure  4.3  Transmitter  of  MDTHP  for  a  3-user  iV-channel  paths/user  system 


One  drawback  of  the  PreRAKETHP  is  that  the  coefficients  of  the  FB  and  FF  filters 
depend  on  the  channel  gains.  Consequently,  the  matrix  factorization  and  inversion  required 
for  the  computation  of  these  coefficients  have  to  be  performed  frequently,  especially  for 
rapidly  varying  fading  channels.  In  this  section,  we  present  an  alternative  THP  design, 
MDTHP,  to  alleviate  this  problem. 

The  transmitter  diagram  for  the  MDTHP  is  shown  in  Fig.  4.3.  Because  M  is 
symmetric  and  positive  definite  in  practice,  we  can  decompose  M  =  FMrFM  by  Cholesky 
factorization,  where  FM  is  a  lower  triangular  matrix.  Divide  FM  into  KxK  blocks,  where  each 
block  is  an  NxN  submatrix;  and  represent  the  block  on  the  z'th  row  and  /th  column  as  [ F vi],/, 
V  i,  ke{  1,2,  . . .,  K} .  The  FB  loop  in  the  transmitter  works  in  the  following  way.  First,  the  N- 
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dimension  vector  \K  is  computed  by  applying  N  weights  to  the  input  symbol  of  the  last  user, 
\K  =  bK  ■  ([Fm]aa'Ca'//)-  The  interference  caused  by  user  K  is  calculated  from  Va,  and  fed  back 
to  be  canceled  from  the  signals  of  other  users.  This  procedure  is  repeated  consecutively  for 
k  =  K-\,  K-2,  thus  forming  vectors  v*.  For  user  i,  Vz  =  1,2,  1,  the  feedback 


from  user  j,j  =  i+ 1 ,  z+2,  ...,  K,  is  given  by 


c,[FmL 


where  /?,=  c,{FM]/[FM]iiC,tf.  Thus, 


for  users  1  through  K- 1,  the  output  of  the  FB  loop  is 


i  v?  C,  [FM]  V  .  rr  i  "  ' 

hi  -  X  — r~o~o~  +  2MA>d<  '  [Fm  ]" c' 

j=M  JPiP, 


(4.16) 


The  output  power  is  normalized  by  multiplying  v,  by  the  scaling  factor 

Svi  =  (c,[Fm]  a  r[FM]  ifcH  )" 1/2  =  (4.17) 

Let  v,  =  Svl  ■  \i  ,  i=l,2,  Represent  the  input  of  the  FF  fdter  by  the  vector 

v  =  [vi  ,V2  ,...,Va  ]  .  Then  its  output  isw  =  GMV,  where  the  FF  filter  is  defined  as  Gm  = 
Fm  .  Following  the  derivation  similar  to  that  for  the  PreRAKETHP  above,  we  obtain  the 
matched  filter  bank  output  in  the  receivers  as  y  =  CMw  +  n.  For  user  i,  this  output  is 

>7  =  ^ / Sl  ( bi+2MA4i )  +  Hi,  i  =  1 ,  2,  . . .,  K,  (4.18) 


where  S)  = 


\-l/2 

N- 1 

X|cy„|2  .  Consequently,  the  ideal  instantaneous  SER  for  the  zth  user  is 

n=0 

J 


Pd  ( Ybt )  = 


(4.19) 
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The  MDTHP  method  significantly  simplifies  transmitter  precoding  compared  to  the 
PreRAKETHP,  since  it  employs  the  factorization  of  the  channel  gain-independent  matrix  M. 
Thus  even  for  rapidly  varying  mobile  radio  channels  the  matrix  factorization  and  inversion 
operations  do  not  have  to  be  perfonned  frequently.  These  filters  depend  only  on  the  signature 
sequences  and  the  order  of  the  users.  The  MDTHP  is  related  to  several  other  simplified  MUD 
and  transmitter  precoding  methods  that  also  employ  the  matrix  M.  First,  equation  (4.19)  is 
identical  to  the  theoretical  perfonnance  of  the  Multipath  Decorrelating  Decision  Feedback 
Receiver  (MDDFR)  (equation  (16)  in  [SK97]).  However,  since  DF  MUD  is  degraded  by 
error  propagation,  the  MDTHP  has  better  actual  performance  than  the  MDDFR. 

For  the  purpose  of  comparing  the  MDTHP  with  linear  decorrelating  precoders,  in 
[LDj04,  Appendix]  we  described  a  linear  multipath  decorrelating  precoder  closely  related  to 
the  MDTHP.  This  linear  precoder  can  be  regarded  as  the  Tx-based  counterpart  of  the 
Multipath  Decorrelating  Detector  (MDD)  [ZB96].  Therefore  we  call  it  Pre-MDD.  Compare 
the  decision  statistic  of  the  MDTHP,  c^Fm^AFmL/c/7,  with  that  of  the  Pre-MDD  (c.f. 
equation  (A.l)  in  [LDj04]),  c;([.VI_  1  ]„■)” 1 C//7.  Based  on  the  theorem  for  the  inverse  of  a 
partitioned  symmetric  matrix  [HJ85,  p.  1 8],  it  is  easy  to  prove  that 
U[Fm]/77[Fm](/C,//  >  c,([IVI"l]„)"lc//  (the  equality  is  satisfied  for  i  =  1).  Therefore,  for  user  1,  the 
MDTHP,  Pre-MDD  as  well  as  MDD  have  the  same  performance;  and  for  other  users  the 
MDTHP  outperforms  the  Pre-MDD  and  MDD. 

Since/p^2  =  Ca{M]^c//=  Ca'[Fm]^7[Fm]awc//,  by  equations  (4.15)  and  (4.19),  the 
PreRAKETHP  and  MDTHP  have  the  same  performance  for  the  last  user.  Their  SER  is  the 
same  as  for  an  isolated  single  user  with  RAKE  receiver.  We  can  obtain  some  insight  into  the 
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performance  comparison  of  the  two  THP  schemes  for  other  users  by  considering  their 
decision  statistics  averaged  over  the  channel  fading.  Considering  that  the  channel  fading 
along  the  paths  towards  different  user  receivers  are  independent  random  processes,  it  can  be 
derived  from  CMC//  =  Fp//Fp  that  E{\fpij\2}«  E{\fpu\2},  V  tej  (£{■}  is  the  expectation  over 
the  channel  gains),  which  is  also  observed  in  simulation  examples.  Therefore,  we  have 
E{fPj?}  x E{Ci[M\uCjH},  V  i  =  1,  2,  ...,  K  .  On  the  other  hand,  V  &K, 

K 

T  T  ^  rri  T  T  rri  7  T 

c([M];,c(-  =  c,X  [Fm];,  [Fm]/(c,  >c,(FM]ii  [Fm]/,«,-  •  These  arguments  indicate  that  on  average 

j=i 

the  decision  statistics  for  PreRAKETHP,  /P„2,  is  larger  than  that  for  MDTHP, 
c({Fm]/,/[Fm],7c//  .  This  observation  provides  the  theoretical  explanation  for  the  better 
performance  of  the  PreRAKETHP.  This  conclusion  is  further  verified  by  simulation  results. 
On  the  other  hand,  the  MDTHP  is  easier  to  implement  as  discussed  above.  In  summary,  there 
is  a  performance-complexity  tradeoff  between  the  PreRAKETHP  and  MDTHP. 

For  M-PAM/QAM  systems,  the  practical  performance  of  THP  methods  is  degraded 
when  M  is  small  due  to  the  power  penalty  and  end  effect  of  mod-2M  operation  [LM94]. 
Usually  the  first  user  suffers  the  worst  degradation  and  the  last  user  is  not  influenced  at  all. 
Therefore,  to  achieve  more  balanced  performance,  we  sort  users  in  the  order  of  decreasing 
received  powers  in  simulation  examples.  For  larger  values  of  M,  since  THP  methods  are  not 
affected  by  error  propagation,  different  user  ordering  might  be  desirable  in  practice  [LD03], 
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4.4  Long-Range  Channel  Prediction 

To  implement  transmitter-based  MAI  cancellation  and  diversity  techniques,  the  CSI 
must  be  available  at  the  transmitter.  CSI  can  be  estimated  at  the  receiver  and  sent  to  the 
transmitter  via  a  feedback  channel.  Thus  the  feedback  delay,  overhead  and  processing  delay 
have  to  be  taken  into  account  in  the  perfonnance  analysis  of  the  transmitter-based  techniques. 
For  very  slowly  fading  channels,  outdated  CSI  is  sufficient  for  reliable  adaptive  system 
design.  For  faster  fading  which  corresponds  to  realistic  mobile  speeds,  however,  even  a  small 
delay  will  cause  significant  performance  degradation,  because  the  channel  variation  due  to 
large  Doppler  shifts  usually  results  in  a  different  channel  at  the  time  of  transmission  than  at 
the  time  of  channel  estimation.  To  realize  the  potential  of  transmitter  precoding  techniques, 
these  channel  variations  have  to  be  reliably  predicted  at  least  several  milliseconds  ahead. 

The  LRP  algorithm  characterizes  the  fading  channel  using  an  autoregressive  (AR) 
model  and  computes  the  minimum  mean  squared  error  (MMSE)  estimate  of  a  future 
coefficient  sample  based  on  a  number  of  past  observations.  In  contrast  to  conventional 
channel  estimation,  LRP  uses  considerably  lower  sampling  rate  to  forecast  future  values  of 
the  fading  coefficient  far  ahead.  Assume  the  complex  Rayleigh  fading  process  c{t )  is  sampled 
at  the  rate  fs=l/Ts,  which  is  at  least  twice  the  maximum  Doppler  shift  f&m.  The  sample  of 
channel  state  information  (CSI)  is  represented  by  cn=c(nTs).  The  linear  MMSE  prediction  of 
the  future  CSI  sample  c„  based  on  p  previously  observed  CSI  samples  c„.  1,  cn. 2,  . . .,  c„.p  is 

p 

C„  =  YAfn-j  ,  (4.20) 

7=1 

where  dj  are  the  coefficients  of  LP  filter  and  p  is  the  AR  model  order.  The  one-step 
prediction  in  equation  (4.20)  can  be  generalized  to  predict  any  time  z  ahead,  that  is,  we  hope 
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to  find  the  MMSE  estimate  of  a  future  sample  c(r)  by  observing  p  previous  samples  collected 
at  and  prior  to  the  moment  zero. 

p-i 

c( t)  =  cu  =  YAf-j  ,  (4.2 1) 

7=0 

where  r=  uTs  is  the  prediction  range,  v  can  be  any  positive  real  number;  specifically,  when  v 
is  an  integer,  the  prediction  is  called  (7-step  prediction.  The  optimal  coefficients  dj  are 
determined  by  the  orthogonal  principle  as 

d  =  R_1r,  (4.22) 

where  d =[(/0,  d\,  ...,  dp. \ ] T,  R  is  the  pxp  autocorrelation  matrix  with  elements 
Rij=E{c(t+T+iTs)c(t+T+jTs)*}  and  r  is  the  px  1  autocorrelation  vector  with  elements 
ij=E{ c(t)c(t+  r+/Ts)*{ .  For  Rayleigh  fading,  rj=r(  r+/Ts)=Jo(2^fim( r+/Ts))  is  shown  in  Fig.  4.4. 
By  the  solution  of  (4.22),  the  MSE,  E{\e(r)\2}=  E{\c(t)  -  c(z)|2},  is  minimized.  The  resulting 
MMSE  is  given  by 

E{\e(T)\2}  =  \  -tdfj.  (4.23) 

7=0 

Since  the  CSI  samples  are  predicted  at  the  frequency  /s,  it  is  necessary  to  use 
interpolation  to  obtain  the  intermediate  coefficients  of  the  channels.  The  predicted  CSI  values 
are  filtered  by  a  lowpass  interpolating  filter,  so  that  the  CSI  at  the  transmit  data  rate  is 
obtained. 
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Autocorrelation  r(t) 


Time  Delay  fd  t 


Figure  4.4  Autocorrelation  of  Rayleigh  fading  signal,  maximum  Doppler  shift  200Hz,  9 
offset  oscillators  in  Jakes  model. 


4.5  Numerical  Results  and  Analysis 

In  this  section,  we  address  the  multipath  fading  channels  that  result  in  different 
received  powers.  Power  control  is  often  needed  in  these  channels.  Since  the  order  of  users 
influences  the  SER  in  THP  systems,  we  propose  to  select  the  order  that  saves  the  total 
transmit  power.  As  mentioned  in  section  II,  the  practical  performance  of  THP  schemes  for 
M-PAM/QAM  systems  is  degraded  when  Mis  small  due  to  the  power  penalty  and  end  effect 
of  mod-2M  operation.  Usually  the  first  user  suffers  the  worst  degradation  and  the  last  user  is 
not  influenced  at  all.  Moreover,  similarly  to  DF-MUD,  as  the  users’  order  increases,  the 
benefit  of  the  THP  becomes  more  pronounced.  Therefore,  to  achieve  more  balanced 
performance  and  to  minimize  transmit  power,  we  sort  users  in  the  order  of  decreasing 
received  powers  in  the  following  simulation  experiments.  In  practice,  it  might  be  desirable  to 
maintain  constant  user  order  for  an  extended  period  of  time  to  minimize  complexity.  (Note 
that  unlike  DF-MUD,  THP  is  not  affected  by  error  propagation,  so  ordering  users  according 
to  their  powers  is  not  required  for  maintaining  reliable  performance.) 

In  Fig.  4. 5-4. 9  we  investigate  the  perfonnance  of  transmitter  precoding  in  multipath 
fading  environments.  In  these  examples,  we  employ  orthogonal  32-chip  Hadamard  codes  as 
the  signature  sequences.  All  channel  paths  experience  i.i.d.  Rayleigh  fading  and  the  total 
average  channel  power  is  normalized  to  one  for  each  user.  First,  consider  an  8-user,  4- 
channel  paths/user  system.  Fig.  4.5  and  Fig.  4.6  show  the  SER  averaged  over  all  users  for 
BPSK  and  16-QAM,  respectively.  (We  do  not  employ  power  control  in  this  paper,  although 
in  practice  it  is  required  to  maintain  the  target  SER.)  The  transmit  powers  of  all  users  are 
equal.  In  both  figures,  the  THP  methods  significantly  outperform  the  conventional  RAKE 
receiver  and  linear  decorrelating  precoding  with  RAKE  receiver  in  [VJ98]  (labeled  as  “Lin. 
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Prec.  RAKE”).  When  the  bit  error  rate  (BER)  is  lower  than  10'3,  PreRAKETHP  is  the  best 
among  all  the  Tx-based  and  Rx-based,  linear  and  nonlinear  decorrelating  methods,  and 
MDTHP  is  better  than  the  MDDFR,  Pre-MDD  and  MDD,  as  expected  from  theoretical 
analysis.  For  higher  SNR  values,  both  THP  methods  outperform  other  techniques.  This 
confirms  that  the  inherent  advantage  of  nonlinear  THP  methods  overcomes  the  adverse 
influence  of  modulo  operation  even  if  M  is  very  small.  The  linear  precoding  in  [VJ98]  is 
seriously  degraded  by  transmit  power  scaling.  The  SUB  is  also  shown  for  reference.  It  is 
given  by  the  performance  of  isolated  single  user  with  RAKE  receiver. 

In  Fig.  4.7,  the  best  and  poorest  user  performances  of  the  three  precoding  methods 
and  the  optimum  linear  decorrelating  detector  RDD  are  further  compared  with  8-PAM 
modulation.  We  observe  that  the  SER  of  linear  precoding  is  higher  than  those  of  other 
methods,  and  the  lowest  SER  is  achieved  by  the  PreRakeTHP.  As  we  have  analyzed,  the 
PreRakeTHP  and  MDTHP  result  in  the  same  SER  for  the  last  user.  Due  to  the  bit-by-bit  user 
ordering,  the  last  user  experiences  the  worst  instantaneous  fading,  and  thus  its  average  SER  is 
the  highest  among  all  users  in  this  example. 

Next,  consider  a  4-user  3-channel  paths/user  BPSK  system,  where  the  transmit 
powers  for  all  users  are  equal.  In  addition  to  short-tenn  multipath  Rayleigh  fading,  the 
transmitted  signal  is  also  subject  to  long-term  shadow  fading  modeled  by  log-normal 
distribution  with  variance  6dB  [StuOl].  Suppose  the  average  received  signal  powers  for  the 
four  users  satisfy  the  ratio  of  8:4:2: 1.  The  instantaneous  received  power  order  is 
corresponding  to  the  order  of  average  power,  so  bit-by-bit  reordering  is  not  performed.  Fig. 
4.8  demonstrates  the  BER  of  the  weakest  user.  The  proposed  two  nonlinear  techniques  and 
the  linear  techniques  of  Pre-RDD,  Pre-MDD  and  linear  precoder  with  RAKE  receiver  in 
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[VJ98]  (labeled  as  “Lin.  Prec.  RAKE”)  are  compared.  As  expected,  for  the  last  user 
PreRAKETHP  and  MDTHP  result  in  the  same  performance  as  for  the  isolated  single  user 
with  RAKE  receiver  (SUB),  while  other  methods  have  poorer  performance.  Equivalently,  the 
users  that  experience  greater  propagation  loss  will  require  lower  power  to  satisfy  the  target 
BER.  Thus,  in  practice  the  overall  transmit  power  can  be  reduced  when  THP  methods  are 
employed. 

A  critical  assumption  for  the  above  simulation  experiments  is  that  the  channel  state 
information  (CSI)  is  perfectly  known  at  the  transmitter.  Next  we  observe  the  performance  of 
the  THP  techniques  under  the  condition  that  the  CSI  is  estimated  at  the  Rx  and  fed  back  to 
the  Tx.  Consider  a  BPSK  modulated  4-user  3-channel  paths/user  wideband-CDMA  (W- 
CDMA)  system  with  the  following  parameters;  the  carrier  frequency  is  2GHz,  the  maximum 
Doppler  frequency  is  200  Hz  and  the  transmit  data  rate  is  128kbps  [ICM98].  The  frequency 
selective  Rayleigh  fading  is  simulated  by  the  Jakes  model  [Jak93].  In  Fig.  4.9,  the  average 
BER  of  the  proposed  precoding  methods  is  shown  for  three  different  cases:  CSI  is  perfectly 
known  at  the  transmitter;  CSI  is  predicted;  CSI  is  fed  back  to  the  transmitter  (no  prediction  is 
used).  For  CSI  prediction,  the  3-step  LRP  algorithm  for  W-CDMA  [DHH00]  is  utilized  with 
the  channel  sampling  frequency  of  1600Hz  and  the  prediction  range  of  0.625ms  (the  slot 
interval).  Since  the  channel  sampling  and  prediction  frequency  is  less  than  the  data  rate,  it  is 
necessary  to  interpolate  the  intermediate  channel  gain  coefficients.  In  the  case  of  delayed  fed 
back  CSI  (FBCSI),  the  channel  gains  are  fed  back  without  prediction  with  the  feedback 
delays  of  0.3125ms  and  0.625ms.  Note  that  transmitter  precoding  aided  by  channel 
prediction  achieves  almost  the  same  performance  as  for  the  case  when  the  channel  is 
perfectly  known  at  the  Tx,  while  feeding  back  delayed  CSI  without  prediction  results  in 
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significant  perfonnance  degradation.  Thus,  accurate  LRP  is  required  for  reliable  multiuser 
precoders  in  practical  mobile  radio  channels. 


61 


Average  SER 


6  8  10  12  14  16  18  20  22 

Actual  Transmit  Eb/Nfl  (dB) 


Figure  4.5  Performance  comparison  of  various  techniques  in  multipath  fading  channels, 
8  users  with  equal  transmit  powers,  4  channel-paths/user,  BPSK. 
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Figure  4.6  Performance  comparison  of  various  techniques,  8  users  with  equal  transmit 
powers,  4  channel-paths/user,  16-QAM. 
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Lowest  and  H 


Figure  4.7  Best  and  poorest  user  SER  for  8  users,  4  channel-paths/user,  8-PAM. 
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Figure  4.8  BER  of  the  weakest  user  in  large  scale  lognormal  multipath  fading  channels, 
4  users  with  equal  transmit  power,  3  channel  paths/user,  ratio  of  the 
received  signal  powers  8:4:2:1,  BPSK. 
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Figure  4.9  Performance  comparison  of  precoding  aided  by  CSI  prediction  and  by  CSI 
feedback. 
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Chapter  5 


LINEAR  DECORRELATING  MULTIUSER  PRECODING 

In  last  two  chapters,  we  have  investigated  the  nonlinear  THP  technique  for  MAI 
cancellation.  We  find  that  although  THP  can  efficiently  reduce  MAI,  its  performance 
advantage  over  linear  methods  is  not  obvious  for  small-M  systems,  such  as  BPSK  and  QPSK, 
due  to  the  power  penalty  and  end  effect  of  mod-2M  operation.  In  addition,  the  nonlinear  THP 
designs  have  higher  complexity  than  the  existing  linear  decorrelating  precoders.  In  this 
chapter,  we  first  describe  the  previously  proposed  linear  decorrelating  precoding  methods, 
and  then  develop  a  few  novel  linear  precoding  schemes.  The  combination  of  precoding  and 
diversity  techniques  is  discussed  in  detail.  Without  loss  of  generality,  only  BPSK  modulated 
systems  are  considered  in  the  following  derivations. 

5,1  Linear  Decorrelating  Precoding  with  RAKE  Receiver 

In  this  method  [VJ98],  the  traditional  RAKE  receiver  is  employed,  which  acts  as  the 
matched  filter  (MF).  The  notation  for  the  frequency-selective  channel  model  remains  the 
same  as  in  section  4. 1 .  For  the  /th  user,  the  output  of  the  RAKE  receiver  is 

AM  T+lTc  *  AM  AM  K  T 

yt  =  Z  /  Ci/r,{t)s,{t-lTc)dt  =  Z  Z  Z  cUnCijbk J  s,( t)s^ t-n Tc+lTc)dt  +  zu  (5.1) 

1=0  iTc  £=0  «=0£=1  0 


where  z,  is  given  by 


Zi  = 


AM  T 

Z  f  cj ni(t)Si(t-lTc)dt. 


1=0  o 


(5.2) 


Define  m  =  ( n-I ),  therefore  me[-(N- 1),  -(N- 2),...,  N- 2,  N- 1].  Denote  the  integral  in 
equation  (5.1)  as 
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(5.3) 


T 

R'a  =  f s  <(  t)sk(  t~m  T .)  dt, 

o 

which  is  the  cross-correlation  between  signature  waveforms.  Then  the  correlation  matrices 
are  given  by 


and 


Kt 

K 

K  ■ 

D-(ZV-l) 
••  Ki,k 

K 

K 

K  • 

n-(V- 2) 
”  Ki,k 

K 

K 

K  ■ 

n-(V-3) 
”  Ki,k 

K1 

uN- 2 
R,k 

K3  ■ 

■■  Ra 

i,ke{l,2,...,K},  (5.4) 


Ru 

Rl,2 

R2,l 

R22 

R2,ZZ 

R  —  {R/,/t}(jVx.K)x(7VxZO  — 


R 


K,\ 


R 


K,2 


R 


K,K 


(5.5) 


In  order  to  express  equation  (5.1)  in  a  compact  form  for  all  the  K  users,  we  also  define  the 
matrices  of  channel  gains.  For  the  z'th  user,  the  row  vector  of  channel  gains  is 
c f  =  [dfl,  ci i,  . . .,  c,yv-i];  then  the  /f-row  (/VxA'j-column  matrix  C,  is  defined  as 


C,  =  diag{  c,}  = 


c,  0  .  0 

0  c,  0  ...  0 

0  .  0  c, 


i=l,2,...,K,  (5.6) 


in  which  0  =  {0} im.  For  the  zth  user,  define  R(z)  =  C  RC,  7;  and  the  kt h  row  vector  of  R(z)  is 


denoted  by  R(z)a,  V  k  =  1,2 K.  Based  on  these  definitions,  the  matrix  that  combines  the 
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channel  gains  and  cross-correlations  of  signature  sequences  is  given  by 
R,=  [R(  V)\H,  R(2)2//,...,  R(K)kI!]!I,  which  is  a  KxK  matrix.  Represent  the  RAKE  combiner 
outputs  of  all  the  K  users  by  the  vector  y  =  [vi,V2,  ...,Vk]T,  and  the  noise  components  by 
z  =  [zi,Z2,  . . . ,  z/(] T.  Then  for  K  users,  equation  (5.1)  can  be  transformed  into  the  compact 
form 

y  =  Rrb  +  z,  (5.7) 

Equation  (5.7)  is  in  the  similar  form  as  the  output  of  the  matched  filter  bank  for  case  of 
AWGN  channels,  therefore  the  matrix  Rr  can  be  interpreted  as  matrix  of  the  cross¬ 
correlations  between  the  received  user  signal  waveforms  in  multipath  case.  It  is  practically 
reasonable  to  assume  Rr  always  invertible.  Based  on  the  assumption  that  the  transmitter  can 
correctly  predict  all  the  channels,  we  insert  the  linear  filter  T  =  R,'1  into  the  transmitter.  The 
corresponding  RAKE  combiner  output  is  y  =  b+z,  thus  the  interference  among  the  user 
signals  is  eliminated.  Although  from  the  above  derivations  T  =  R,  is  obtained  as  the  zero 
forcing  filter,  it  also  satisfies  the  MMSE  criterion  [VJ98,  Appendix  A].  Similarly  to  the  linear 
decorrelating  precoding  in  AWGN  channels,  the  total  transmit  power  is  increased  as  a 
consequence  of  the  linear  transmitter  precoding.  Without  precoding,  the  total  average 

K 

transmit  energy  per  symbol  interval  for  a  BPSK  system  is  Eav=  YAr~-  With  the  assumption 

i=  1 

that  the  spreading  sequences  for  all  users  are  orthogonal  to  one  another,  the  total  average 

K 

transmit  energy  per  symbol  interval  equals  to  Eav  =  X^A  Rr  R(  Therefore,  the  power¬ 
ful 

scaling  factor  Sf  is  given  by  the  ratio 
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(5.8) 


The  actual  transmit  signal  should  be  x(t)  =  .S’/s7(/)Rr‘lb.  At  the  receivers,  the  symbol 
detector  input  is  SJb  +  z.  From  equation  (5.2),  the  noise  component  for  the  z'th  user  has  zero 
mean  and  variance 


N-lN-l 


var  jz,  }  =  No  Z  Z  R  , <Az<A» 


Hence,  the  BER  for  the  z'th  user,  i  =  1,  2,. . .,  K,  is  given  by 


Pe=0  2srn‘ 

I  ^  N-lN-l 

1  XXK"ci,icl 


(5.10) 


Although  this  precoding  method  completely  eliminates  the  MAI,  it  is  shown  by  equation 
(5.10)  that  the  BER  is  increased  by  the  power  scaling  factor  Sf.  Also,  this  method  does  not 
completely  simplify  the  mobile  user  receiver,  since  it  uses  the  Rx-based  RAKE  combiner. 

5.2  Linear  Decorrelating  Prefilters 

In  the  method  of  linear  decorrelating  transmitter  precoding  with  RAKE  receiver,  the 
linear  filtering  is  applied  directly  to  the  data  symbols  and  followed  by  spreading;  while 
another  way  to  apply  a  linear  filter  in  the  transmitter  is  to  filter  the  transmit  signal  which  has 
already  been  spread,  this  approach  was  named  as  decorrelating  prefilters  [BD00] . 

For  a  K-user  system,  the  data  symbols  of  every  user  is  first  spread  with  its  signature 
sequence,  and  then  passed  through  a  Z^-tap  linear  filter.  For  the  z'th  user,  represent  the 


70 


prefilter  coefficients  by  a  column  vector  p,  =  \pu 0,  pL \ ,  Pi,LP-\ ] ' ,  i=l,2,...,K.  The  final 

transmit  signal  is  given  by 

40  =  X  YjbkPkJsk  ( t-jTc ) .  (5.11) 

k= 1  y=0 

In  the  /th  user  receiver,  the  received  signal  is 

ri (0  =  X  X  TjCuPk,jbksk  0  -  O'  +  00  )  +  ni  (0  .  (5.12) 

/=o  k= l  y=o 

The  received  signal  is  fed  into  the  filter  simply  matched  to  sl{t)  instead  of  the  RAKE 
combiner,  since  in  this  approach  the  multipath  interference  is  aimed  to  be  rejected  in  the 
transmitter,  and  the  mobile  user  receivers  remain  as  simple  as  those  in  single-path  channels. 

k  N~l  Tr 

With  the  notation  aLJ  =  Y,cijjSk(t-(j+l)Tc)si(t)dt,  the  output  of  the  matched  filter  in  the  /th 

/=o  ’  o 


user  receiver  equals 


K  f  Lp-\ 

V,  =  X  hk  Z  Pkjaij 

k= 1  7=0 


+  ni , 


(5.13) 


where  =  J//,(/).s- ,(/)///.  Thus  to  recover  b,  from  y„  the  prefilter  optimization  problem  can  be 


expressed  mathematically  as 


max 

Pui 


r Lp-\  ^ 

Z  p:Aj 

V  7=0  7 


(5.14) 


subject  to  the  constraints 


Lp- 1 


Z  PkJal=0,y  k^i, 

7=0 
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and 


T.PtM-JTc)  =1’  Vk=l,2,...,K. 

j= o 

The  latter  constraint  is  to  avoid  transmit  power  increase  due  to  the  prefilter.  (||  •  ||  is  the 
standard  2-norm.)  Define  matrices 


*1,0 

*1.1  ' 

*1,  Lp-l 

i 

*2,0 

i 

*2,1 

"  a‘2’Lp-'  /=12 

:,K. 

(5.15) 

i 

aK,  0 

i 

aK,  1 

i 

aK,Lp-\ 

For  user  i,  the  solution  for  vector  p,  which  satisfies  (5.14)  as  well  as  the  constraints  can  be 
found  by  solving  the  equation  about  an  Z./;- tuple  vector  p,  =  [pj  0 ,  /), , ,...,  pt  Lp  ,  ]T 

A,  x  p,  =  e, ,  (5.16) 

where  vector  e,  is  an  all-zero  vector  except  for  a  unity  entry  at  position  i.  If  A,  is  a  consistent 
matrix,  an  exact  zero-forcing  solution  can  be  obtained;  otherwise,  a  lease  square  solution  can 
be  obtained  by  minimizing  ||  e,-A,x  p.  ||.  The  prefilter  coefficients  p,  are  obtained  by 

nonnalizing  p, ,  p,-  =  5/Jp . ,  in  which  the  power  scaling  factor  Sft  is  given  by 

5/|  = - - - ,  i  =  l,2,...,  K.  (5.17) 

Lp- 1 

'ZhM-jV 

j= 0 

It  is  noted  that  each  user  prefilter  is  normalized  individually,  which  allows  each  user  to 
achieve  a  specified  received  symbol  energy-to-noise  ratio,  instead  of  using  a  global  transmit 
signal  normalization  as  in  the  linear  decorrelating  precoding  scheme. 
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By  the  prefilter  optimization,  the  symbol  detector  input  in  the  /th  user  receiver  is 


Sfbj  +  Re [/;,]-  >  =  1,2 therefore,  the  BER  for  user  i  is  given  by 


re,  = 


(5.18) 


An  important  advantage  of  the  linear  decorrelating  prefilter  is  that  it  effectively 
applies  the  preprocessing  matrix  to  the  output  of  the  spread  spectrum  encoder,  so  it  can  be 
applied  externally  to  an  existing  design.  However,  as  shown  by  the  simulation  results  in 
[BDOO]  and  our  simulation  in  section  5.5,  this  method  does  not  have  performance  benefit 
compared  to  the  linear  precoder  in  [VJ98], 
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5.3  PreRAKE  Multiuser  Precoding  (Pre-RDD) 

The  Pre-RDD  [Gun03]  was  inspired  from  the  Rx-based  RDD  described  in  section 
2.2.1.  In  the  transmitter  of  Pre-RDD,  a  linear  decorrelating  filter  is  followed  by  a  Pre-RAKE 
combiner.  Suppose  the  linear  filter  is  represented  by  a  KxK  matrix  G.  The  output  vector  of 
the  filter  is  thus  given  by  w  =  [wl,w2,...,wK  ]T  =  Gb  .  The  result  of  Pre-RAKE  combining  is 
nonnalized  by  a  global  power  scaling  factor  Sf .  Therefore,  the  ultimate  transmit  signal  from 
BS  is  given  by 

x(t)  =  -(JV-1- n)Tc)  (5.19) 

k= 1  12=0 

At  the  mobile  user  receivers,  simple  matched  filters  are  used.  For  all  K  users,  the  MF 
output  vector  y  =\yx,y2,...,yKY  can  be  expressed  in  a  compact  form  y  =  SECRC^Gb  +  n . 

Define  the  correlation  matrix  R  =  CRCW  and  the  precoding  filter  G  =  S  R  =  (CRC;/ )  1 . 
Then  the  MF  output  is  simplified  into  y  =  S  f  S  'b  +  n  ,  where  the  diagonal  matrix  of  channel 

gain  nonnalization  factors  S  follows  the  definition  in  section  4.1.  Equivalently,  the  MF 
output  for  the  7th  user  is 

V,  =  +"i  (5-2°) 

A 

To  normalize  the  total  transmit  power,  the  average  transmitted  signal  energy  per  symbol 
Eb{j|*(0r</]  should  be  equal  to  the  original  average  transmit  energy  Eb  fb 7 b ] .  That  is,  we 
need  to  satisfy 
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(5.21) 


■s/E4![g"crc"g1,  =i>,2. 

i= 1  i=l 

Therefore,  the  global  power  scaling  factor  is  given  by 


K 


i=l 


By  equation  (5.20),  the  BER  of  the  kth  user  in  BPSK  systems  is 


(5.22) 


(5.23) 


Although  the  Pre-RDD  has  the  same  decorrelating  filter  and  RAKE  combining 
structure  as  the  Rx-based  RDD,  their  theoretical  BER  are  different.  From  equation  (5.23),  we 
observe  that  if  the  transmit  power  for  all  users  are  equal,  for  Pre-RDD  all  users  have  identical 
error  rate.  This  is  because  all  users  share  the  same  power  scaling  factor.  As  derived  in  section 
2.2.1,  for  RDD,  different  users  can  have  different  error  rates. 
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5.4  Linear  Decorrelating  Multiuser  Precoding  Combined  with  Transmit  Antenna 
Diversity 

5.4.1  Transmitter-Based  Antenna  Diversity  and  Multi-Input-Single-Output  (MISO) 
Channel  model 

For  the  downlink  of  mobile  radio  systems,  transmitter-based  diversity  techniques 
enable  to  shift  signal  processing  from  MS  to  BS,  where  power  and  computational  complexity 
are  more  abundant,  thus  the  mobile  user  receiver  units  are  simplified.  Transmit  antenna 
diversity  at  the  BS  has  been  identified  as  an  efficient  way  of  improving  the  downlink  data 
rate  without  expanding  the  bandwidth,  especially  in  low-mobility  environments,  where  there 
is  insufficient  time  or  frequency  diversity  [Ala98,  Lo99,  BZP04].  Indeed,  transmit  diversity 
has  already  been  adopted  by  the  3GPP  standards  [TSG01]. 

Closed-loop  transmit  antenna  diversity  techniques  achieve  better  performance  than 
open-loop  methods,  but  they  require  feedback  of  the  channel  state  information  from  the  MS, 
as  shown  in  Fig.  5.1.  However,  in  practical  rapidly  varying  fading  channels,  the  fed  back  CSI 
is  not  up-to-date  which  results  in  performance  degradation.  The  LRP  algorithm  has  superior 
performance  relative  to  conventional  channel  prediction  methods,  because  of  its  much  longer 
memory  span  obtained  by  using  lower  sampling  rate  [DHOO],  The  performance  of  transmit 
antenna  diversity  techniques  aided  by  LRP  are  analyzed  in  [GD05]. 

The  hybrid  designs  which  combines  transmitter-based  MAI  cancellation  with  transmit 
antenna  diversity  are  described  in  this  chapter.  We  only  elaborate  the  combination  of  linear 
precoding  with  multiple  transmit  antennas;  however,  these  space-time  designs  can  be  easily 
extended  to  nonlinear  precoding. 


76 


Figure  5.1  System  Diagram  of  Closed-loop  Transmit  Antenna  Diverisy 


Fig.  5.1  shows  the  MISO  channel  diagram.  Consider  a  downlink  DS/CDMA  system 
with  K  active  mobile  users  in  a  cell.  The  BS  employs  L  transmit  antennas,  and  the  MS  for 
each  user  employs  single  receive  antenna.  The  channels  from  the  antennas  at  the  BS  to  the 
MSs  are  each  subject  to  frequency-selective  Rayleigh  fading  with  N  resolvable  paths.  If  the 
baseband-equivalent  signal  transmitted  from  the  /th  antenna  in  the  data  interval  of  interest  is 
denoted  by  x(l\t),  /  =  1,2,  ...,  L,  then  the  signal  received  by  the  z'th  user  can  be  expressed  as 

N- 1  L 

w)  =  ZZc<y>  (t  -  nT  )  +  n,(t) ,  where  Tc  is  the  chip  duration,  cf]  is  the  channel  gain 

n= 0  l=\ 

coefficient  of  the  /7th  path  from  the  /th  transmit  antenna  to  the  z'th  user’s  receiver  and  zz,(7)  is 
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the  complex  AWGN  with  power  spectral  density  No,  for  V  z  =  1,2,  ...,K  and 
n  =  0,  1,  1.  The  channel  coefficients  c\l]  are  modeled  as  independent  and  identically 


N- 1 


(0 


distributed  (i.i.d)  Rayleigh  fading  random  variables  with  E{  X  |  c,-,„  |  }=  1.  (E{-}  is  the 


«=0 


expectation.)  With  the  previously  defined  notations  for  user  data  and  spreading  codes,  when 
no  Tx-based  processing  is  applied,  the  signal  received  by  the  z'th  user  is  given  by 

ri (0  =  Tj Z Z ciJ>bksk  if  ~  nTc )  +  ni  (0 .  (5 .24) 

n= 0  /= 1  k= 1 


For  the  convenience  of  derivation,  define  the  channel  gain  row  vector  for  the  z'th  user 

corresponding  to  the  /th  transmit  antenna  as  cf  =  [c%,  and  for  all  K  users, 

/a  (4  (0  W 

define  the  AT-row  (Wx/Q-column  channel  gain  matrix  Cu  =  diag{c  i ,  C2,...,Ck  }, 
V/=  1,  2 To  avoid  the  transmit  power  increase  incurred  by  channel  gain  coefficients, 
we  use  the  normalized  values  of  channel  gain  coefficients  in  precoding.  For  the  z'th  user,  the 
channel  gain  nonnalization  factor  is  defined  as 


■W  = 


f  L  N-\ 


-1/2 


c(Z) 

i,n 


(5.25) 


V/= 1  n= 0  J 

Then  the  nonnalized  channel  gain  coefficient  is  cfl  =  Scic\1].  Define  the  nonnalized 
vectorc,(/)  =Sci  •  c/Z) and  matrix  C(/)  =  z/z'ag{c,</),c2(/),...,cA:(/)}, respectively.  By  representing 
the  nonnalization  factors  with  a  diagonalifxif  matrix  Sc=diag{Sci,SC2,...,ScK},  we  obtain 
the  relation  C(/)  =  . 
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5.4.2  Zero  Forcing  (ZF)-based  PreRAKE  Linear  Decorrelating  Precoding 


(PreRAKELDP) 


Figure  5.2  Transmitter  diagram  of  PreRAKELDP  for  a  2-user,  Z-antenna  System 


The  transmitter  diagram  of  the  PreRAKELDP  with  multiple  Tx  antennas  is  shown  in 
Fig.  5.2.  The  linear  MAI  pre-cancellation  process  actually  increases  the  required  transmit 
power.  To  offset  the  power  increase,  we  scale  the  amplitude  of  the  transmit  signal  b,  by  a 
factor  Sfh  V  i  =  1, 2,  ...,  K.  For  all  users,  define  the  diagonal  scaling  factor  matrix 
Sf  =  diag{S/i,  S/2,  ...,  S/k}.  Suppose  the  decorrelating  (ZF)  filter  is  a  KxK  matrix  G,  and  the 
result  of  power  scaling  and  decorrelating  is  denoted  by  vector  w. 
Then  w  =  [iv1,w2,...,wA  ]r  =GSfb.  The  transmitted  signal  from  the  /th  antenna,  x  l](t),  is 
generated  by  passing  the  decorrelating  filter  output  through  the  /th  branch  of  the  preRAKE 
combiner 
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(5.26) 


,<o 


K  N-\ 


(0  =  X  X  W*^S  0  -  (tf  - 1  -  n)Tc )  . 
&=1  «=0 


At  the  zth  user  receiver,  the  equivalent  baseband  received  signal  is  given  by 


r/(o = ZZ<>(,)('  -ptc)+ nV)- 

p= o  1=1 


(5.27) 


A  channel-gain-independent  matched  filter  (MF)  is  used  at  the  front  end  of  each  user’s 
receiver.  To  obtain  the  largest  received  signal  energy,  the  output  of  the  matched  filters  should 
be  sampled  at  the  moment  t  =  ( N-\)TC  [ESN99].  The  output  of  the  zth  user’s  MF  is  given  by 

fT+(N-V)Tc 


y,  =  f  l)7, ' 1  n  (Os,  (t-(N  -  i )tc  )dt . 


(5.28) 


Substituting  (5.26)  and  (5.27)  into  (5.28),  we  obtain 


K  N-\N-l-n  L 


yt  =  X  Z  X  ZcS+»3K  wkKk  +ni> 


(5.29) 


k= 1  ;;=0  m=—n  l=\ 


where  n,  is  the  filtered  noise  component  with  power  E\nini\=N0.  Define  vectors 


y  =  [vi,  V2,  . . . ,  v^] 7  and  n  =  [m,  in,  iik\.  Then  the  vector  of  MF  outputs  of  all  K  users 


(  L 


y  =  sc 


2]C(/,RC(/|  GSfb  +  n . 

V  z=i  ) 


(5.30) 


Let 


R  =  ^C( ’RC 


w 


(5.31) 


/= 1 


To  recover  the  desired  user  signal,  define  the  decorrelating  precoding  filter 


G  =  R1. 


(5.32) 


Then  equation  (5.30)  can  be  simplified  as 


y  =  ScSfb  +  n. 


(5.33) 
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It  is  obvious  that  the  MF  output  for  each  user,  y.  =  (Sfi  / Sci  )bt  +  ni ,  does  not  contain  MAI. 

Now  we  detennine  the  values  of  the  transmit  power  scaling  factors.  The  total  average 
energy  of  the  transmit  signal  during  one  symbol  interval  should  satisfy 


E 


b 


=  £b{brb}  ,  where  Eb{-}  is  the  expected  value  with  respect  to  the  data 


L  T 


symbols.  From  equation  (5.26),  Eb  SJ  |*(/)(0| 


1=1  0 


dt 


=fs’b{b7G//Sf2b};  hence,  it  is  sufficient  to 


require  user  i  to  satisfy  GuSfi  =  1 ,  /  =  1,  2,  ...,  K.  From  (5.31)  and  (5.32),  we  find  that  the 
diagonal  elements  of  G  are  real.  Therefore,  the  power  scaling  factor  for  user  i  is 

1 


9  2  =  — 

Sfi  Gti 


[R-'Lv ' 


(5.34) 


Consequently,  for  user  i,  the  BER  of  the  PreRAKELDP  with  individual  power  scaling  in 
tenns  of  the  average  transmit  signal-to-noise  ratio  (SNR)  y/„  is 


PeXrJ  =  Q 


N-\  L 


2ZZd 


(0 


n= 0  /= 1 


[R1],, 


-Yu 


(5.35) 


V  J 

where  ;//„  =  E/JNo  =  AG(2No).  Comparing  equation  (5.35)  with  the  BER  formula  of  RDD 
[HS94],  we  find  that  in  the  absence  of  antenna  diversity,  i.e.,  when  L=  1,  the  perfonnance  of 
the  PreRAKELDP  with  power  scaling  is  identical  to  that  of  RDD.  The  Pre-RDD  [Gun03]  is 
equivalent  to  the  precoder  derived  in  this  section  for  a  single  antenna,  but  a  global  power 
scaling  factor  instead  of  individual  power  scaling  factors  is  utilized.  As  a  result,  all  users 
have  the  same  BER  given  by  the  average  of  all  users’  BERs  of  the  PreRAKELDP  method.  In 
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practice,  the  PreRAKELDP  is  more  flexible  because  it  allows  different  users  to  satisfy  their 
individual  reliability  criteria. 

Comparing  the  Pre-RDD,  ZF-based  PreRAKELDP  and  the  linear  decorrelating 
precoding  with  RAKE  receiver  in  [VJ98],  we  can  find  that  all  the  three  methods  utilize 
RAKE  multipath  diversity  technique;  the  only  difference  is  that  the  method  in  [VJ98]  uses 
RAKE  combining  in  receiver,  while  the  other  two  methods  use  it  in  transmitter.  However, 
this  different  in  structure  results  in  the  performance  difference  between  the  method  in  [VJ98] 
and  the  two  PreRAKE  precoding  methods.  The  detailed  performance  analysis  for  a  2-user 
system  is  provided  in  the  Appendix.  More  comparison  results  are  obtained  by  simulation 
examples  in  section  5.5. 

5.4.3  Minimum  Mean  Square  Error  (MMSE)-based  PreRAKELDP 

From  (5.35),  we  observe  that  the  power  scaling  factors  degrade  precoding 
performance.  With  the  same  transmitter  structure  as  in  Fig.l,  in  this  section  our  goal  is  to 
design  the  optimum  linear  precoding  filter  G  in  the  sense  of  MMSE  criterion  under  the  total 
average  transmit  power  constraint.  Similarly  to  equation  (5.30),  with  linear  precoding,  the 
sampled  MF  output  vector  at  the  receiver  is  given  by 

Jl  (/)  -  (ip 

y=IC  RC  Gb  +  n.  (5.36) 

/=  i 

The  filter  G  should  be  the  solution  of 

min  ZTb{||b-Scy||2} 

GsCKxK 

subject  to 


82 


(5.37) 


For  the  convenience  of  following  derivations,  define  the  amplitude  matrix  for  all  users  as 


Am  =  diag{'\[Ebi,  yJWb ~2,  ...,  -\j EbK }.  It  is  easily  known  that  L  J  fx2(t)dt  l  =  tr  jG^RGA,,,2!  .  By 


means  of  Lagrange  multiplier  method,  it  is  desired  to  find  G  which  minimizes 


£  =  £b{||b-Scy||2}+A  tr {GHRGAm2} . 


(5.38) 


The  derivative  of  the  first  term  with  respect  to  each  element  of  G  is 


^T-£b{||b-Scy||2}  =  ^£b{(b7G7R+n7’Sc)(RGb+Scn) 


2  [R2GAm2]y— 2  [RAm2]  y 


(5.39) 


The  derivative  of  second  tenn  w.r.t.  each  element  of  G  is 


8U-triGHRGA  2) 


d  K  K  - 
/ IE l)j  ^  Gmj  ^  Rm,nGnj 

0(jij  m= 1  n=  1 


=  2 AEbj  I  RUnG„j  =  2/[RGAni2],- 

n= 1 


(5.40) 


Since  -  0,  we  get 


[R2GAm2](V-[RAm2](-/  =  /[RGA,,,2], 


(5.41) 


In  terms  of  matrices,  (5.41)  is  equivalent  to 


R2GA  2-RA  2  =  /LRGA  2 


G  =  (R+  Ilf 


(5.42) 
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where  the  value  of  X  can  be  detennined  from  the  constraint  (5.37),  which  is  equivalent  to 

tr{G  RGAm~}  =  tr{Am  }.  Note  that  R  is  positive  definite.  Suppose  its  eigenvalues  are  a\, 
a.2,...,  ok •  Define  the  unitary  matrix  F=  { Fy) KxK  with  /th  column  given  by  the  eigenvector  of 

R  corresponding  to  aj,  j=\,2,...,K.  Therefore,  R  =  Fdiag{aj}F"  and 
G  =  Fdiag{(aj+Xy  }  FH,  where  diag{ aj}  represents  the  K*K  diagonal  matrix  with  aj  at  the  /th 
diagonal  position.  The  total  power  constraint  can  be  expressed  as 

=  X^2.  (5.43) 

i=\ 

By  solving  this  equation  we  can  obtain  the  value  of  X. 

The  total  power  constraint  optimization  for  single-path  AWGN  channel  was  studied 
in  [VJ98]  and  [HKT02].  It  was  shown  that  for  the  single-path  channel  with  highly-correlated 
signature  sequences,  the  total  power  constraint  optimization  does  not  have  the  BER 
performance  advantage  over  the  transmit  power  scaling  method,  due  to  the  severe  residual 
MAI  in  the  received  signals.  However,  we  consider  a  different  channel  model  in  this  paper 
corresponding  to  the  orthogonal  downlink  CDMA  over  multipath  fading  channels.  For  this 
model,  the  residual  MAI  is  very  small  for  the  total  power  constraint  approach.  Therefore,  the 
total  power  constraint  optimization  results  in  better  performance  than  the  individual  power 
scaling  for  the  PreRAKELDP.  This  conclusion  will  be  verified  by  simulation  results.  On  the 
other  hand,  optimization  under  the  total  power  constraint  has  much  higher  complexity  than 
the  transmit  power  scaling. 

It  should  be  noted  that  to  achieve  a  simple  and  practical  transmitter  design  for  a 
system  with  large  number  of  users,  the  allocation  of  individual  user  transmit  powers  is  not 


K 

X 

1=1 


,2j.  I Ftfa, 
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taken  into  account  in  the  PreRAKELDP  designs.  Since  the  pre-RAKE  combiner  is  equivalent 
to  a  matched  filter  which  is  matched  to  the  multipath  fading  channels,  the  PreRAKELDP 
with  individual  user  power  scaling  and  that  with  total  transmit  power  constraint  are  the 
optimum  linear  ZF  and  MMSE  precoders,  respectively. 

5.4.4  Multipath  Decorrelating  Precoding  (MDP) 

In  the  PreRAKELDP,  the  calculation  of  precoding  filter  coefficients  requires  the 
operation  of  matrix  inversion  of  a  CSI-dependent  matrix,  which  results  in  high  computational 
complexity.  To  simplify  the  transmitter,  we  present  an  alternative  precoding  algorithm, 
MDP.  As  in  the  PreRAKELDP,  in  the  MDP  design  each  MS  only  needs  to  use  a  simple  and 
CSI-independent  matched  filter  for  data  detection. 

As  shown  in  Fig.  5.3,  for  a  system  with  L  transmit  antennas,  the  precoding  process 
consists  of  L  parallel  and  independent  branches.  Fig.  5.4  demonstrates  the  detailed  structure 
of  each  branch.  In  the  /th  branch,  the  data  symbols  for  the  K  users  are  first  weighed  by  the 

nonnalized  channel  gains,  with  the  output  given  by  the  (Kx/V)-element  vector  C  1  ^  b  .  As 
shown  in  Fig.  5.4,  the  channel  gain  weighed  signals  are  filtered  by  the  power  control  filter 
and  decorrelating  filter.  The  decorrelating  filter  is  defined  as  G  =  R  '.  The  power  control 
filter  for  the/th  user  is  defined  as  T  =  St  ([G]„)  1  ([G]y  represents  the  (7,  /)th  NxN  block  of 

G);  for  all  K  users,  define  the  compact  form  as  T=diag{T\,T2,. .  .,Ta:} ,  which  is  a  (KxN)- row 
(Ax/V)-column  matrix  with  T,  as  the  /th  diagonal  NxN  block,  /  =  1,  2,  ..., K.  Note  that  the 
notation  for  the  power  control  filters  and  decorrelating  filter  are  not  identified  with  an 
antenna  index,  which  means  these  filters  have  exactly  the  same  definitions  for  all  the  L 
branches.  The  output  of  the  decorrelating  filter  for  the  /th  branch  is  given  by 


85 


w(/)  =  =GTC^‘b  .  Following  the  signature  sequence  spreading,  the 


transmitted  signal  from  the  /th  antenna  is 


’  (0  =  ZE  ^  )• 


(5.44) 


It  is  sufficient  to  require  the  individual  users  to  satisfy 


i=  1,2,  . . .,  K.  It  can  be  calculated  that  St  is 


s,=  Zc«>([G]„)-'c! 


(5.45) 


At  the  /th  user’s  receiver,  the  baseband-equivalent  received  signal  is  given  by 


ri  (t)  =  YJYjCi')px0)  (t  -  pTc)  +  n^t) . 


(5.46) 


With  the  same  MF  as  in  the  PreRAKELDP,  the  MF  result  for  the  /th  user  is 


K  N-lN-l  L 


Vi  = 

k=\  n= 0  p=0  1= 1 


N -\-(n+ p) 


(5.47) 


where  nt  is  the  filtered  noise  component  with  power E{nin*}=  N0 .  For  all  K  users,  the 
compact  form  of  the  MF  output  is  given  by 


y  =  Sc_1  ^C(/)TC(/)//  b  +  n 


(5.48) 


Therefore,  the  simplified  result  of  v*  is 


y  =  Scf’dR-'U-'c,1'1''  \bi+nr 


(5.49) 
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Note  that  the  MAI  is  completely  cancelled  by  transmitter  precoding.  The  BER  in  terms  of  ybi 
is  given  by 


Pe^Yin)  =  Q 


SZcf’dR-'],,)- 


V'1 


Ybi 


(5.50) 


The  BER  of  single-antenna  MDP  is  identical  to  that  of  the  Rx-based  MDD  [ZB96,  Zvo96], 
The  MDD  is  a  suboptimal  linear  decorrelating  MUD.  It  is  shown  in  [HS94]  that  the  RDD 
outperforms  the  MDD.  Since  the  PreRAKELDP  with  individual  power  scaling  has  identical 
performance  to  the  RDD,  it  has  better  performance  than  the  MDD  and  MDP.  However,  as  the 
precoding  process  in  MDP  does  not  involve  the  inverse  operation  of  CSI-related  matrix,  the 
MDP  is  computationally  simpler  than  the  PreRAKELDP  and  the  RDD. 

The  STPR  MUP  in  [GDv03]  employs  the  same  decorrelating  filter  R'1  as  the  MDP 
does;  thus  it  has  similar  complexity  to  the  MDP.  For  the  simple  case  of  equal  transmit  power 
for  all  users  (i.e.,  A\  =  A2  =...  =  Ak),  the  following  derivation  shows  that  the  MDP  has  lower 
BER  than  the  STPR  MUP.  Without  losses  of  generality,  we  only  illustrate  the  case  of  single 
transmit  antenna.  Using  the  notations  defined  above,  the  BER  for  the  STPR  MUP  is  given  by 

f  \ 


pe,(rhl)  =  Q 


2K 


-Yu 


k= 1 


(5.51) 


Considering c,  =  Sa  c,  and  from  equations  (5.50)  and  (5.51),  it  is  known  that  comparing  the 
performance  of  these  two  methods  is  equivalent  to  comparing  the  decision  statistics  for  the 

f K  V1 

MDP,  X,  sc,.([R- and  that  for  STPR  MUP,  Y  =  K  £c*[R-I]ttct"  .Although 

V  k= 1  J 


87 


direct  comparison  between  Xt  and  Y  does  not  lead  to  definite  conclusion  due  to  the  fact  that 
the  comparison  result  is  influenced  by  the  instantaneous  channel  gain  coefficients  and 
signature  sequences  assignment  among  users,  it  is  still  inspiring  to  analyze  the  BER  averaged 

over  all  users  for  these  two  methods  by  means  of  comparing  X,  and  Y  =(c,LR  l]„c//)  , 


V  /=  1,2, 

From  the  definition  of  matrix  R,  we  know  it  is  positive  definite;  thus  R"1,  its  NxN 
principle  submatrix  [R  and  ([R  -%r  are  also  positive  definite  [HJ85],  i  =  1,2, 

Matrix  [R  ’]  a  can  be  factored  into  UAU^,  where  U  is  an  NxN  unitary  matrix  and  A  is  a 
diagonal  matrix  with  the  eigenvalues  of  [R"']„,  J3\,  f:h,  ...,  J3N,  on  its  diagonal.  Notice  that  A,, 
Yj  and  all  eigenvalues  of  [R  ]..  are  positive  real  numbers,  V  i=  1,2,  . . .,  K.  The  ratio  of  Xt  to 
Yj  is  given  by 


Xt/Yt  =c,.([R-1],)-1c,.//c,.[R-1],c1.//  =  c,UA-IU"c  "c,UAU 


HcH. 


(5.52) 


Denote  the  product  of  c,U  as  u  =  \ul,u2,...,uN],  Therefore, 


(  N  -  \ 

(  2k 

N  U 

Xi/Yi  =  uN'u'uAu'^ 

f  ZklV, 

k  «=i  J 

y  j 

12  V 


V  /= 1  J 


N  N 

+ y ^\u„um 

|  n  m 

m= 1 n= 1 
n^m 


^ Hvl  +  JL-2 

V  fin  Pm  ) 


(5.53) 


In  this  result,  the  first  term  is  equal  to  uuH  =  c( UU " c;"  =  1  and  the  second  tenn  is  non¬ 
negative;  thus  Xi/Yi  >  1 .  Therefore,  it  is  proved  that  Xt  >  Yt .  This  analysis  indicates  that  the 

average  performance  of  MDP  is  better  than  that  of  the  STPR  MUP.  More  accurate 
comparison  among  linear  precoders  is  given  by  simulation  experiments. 
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Figure  5.3  Transmitter  Diagram  of  MDP  for  a  2-user,  T-antenna,  iV-channel  paths/user 
system 


Figure  5.4  The  Structure  of  the  /th  branch  in  the  MDP  transmitter  for  a  2-user,  2- 
path/user  system 
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5.5  Numerical  Results  and  Analysis 

In  this  section,  we  observe  the  performance  of  linear  precoding  techniques  by  a  few 
numerical  experiments.  For  all  simulation  examples,  BPSK  modulation,  orthogonal  signature 
sequence  spreading  and  equal  transmit  power  for  all  users  are  assumed.  There  are  three 
resolvable  channel  paths  from  each  Tx  antenna  at  BS  to  each  user’s  Rx  antenna.  In  the 
first  example,  consider  a  4-user  uncoded  system  with  single  transmit  antenna.  Fig.  5.5  shows 
the  BER  averaged  over  all  users  for  the  ZF -based  approaches:  PreRAKELDP  with  individual 
power  scaling  (labeled  as“PreRAKELDP_ps”),  MDP,  linear  precoding  with  RAKE  receiver 
(labeled  as  “Prec.  With  RAKE”)  [VJ98],  linear  decorrelating  prefilters  (labeled  as 
“Prefilters”)  [BDOO],  STPR  MUP  [GDv03],  Rx-based  RDD  [HS94]  and  MDD  [ZB96].  As 
analyzed  before,  the  proposed  PreRAKELDP_ps  and  MDP  have  identical  BER  to  RDD  and 
MDD,  respectively.  Observe  that  the  two  new  precoders  significantly  outperform  other 
methods.  The  MDP  has  modestly  higher  BER  than  the  PreRAKELDP_ps. 

Next,  we  compare  the  performance  of  the  three  ZF  precoders,  PreRAKELDP_ps, 
MDP  and  STPR  MUP,  with  multiple  Tx  antennas.  An  8-user  uncoded  system  is  considered. 
Fig.  5.6  shows  the  average  BER  of  these  methods  with  one  and  three  transmit  antennas, 
respectively.  Obviously,  the  antenna  diversity  results  in  significant  performance 
improvement.  Fig.  5.7  shows  the  BER  averaged  over  all  users  versus  the  number  of  users. 
The  transmit  SNR  is  fixed  at  OdB.  Observe  that  the  BER  saturates  as  the  number  of  users 
becomes  greater  than  eight.  This  observation  is  consistent  with  the  result  in  [Ver98,  p255] 
that  for  decorrelating  (ZF)  MAI  cancellation  methods,  the  BER  converges  as  the  number  of 
users  K—>  go. 
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In  Fig.  5.8,  we  compare  the  perfonnance  of  the  PreRAKELDP  with  individual  power 
scaling  (PreRAKELDP_ps)  and  with  total  power  constraint  (PreRAKELDP_pc).  An  8-user 
single-transmit  antenna  system  is  considered.  Both  coded  and  uncoded  cases  are  simulated. 
For  the  coded  system,  we  employ  the  rate  Vi  convolutional  code  with  generator  vectors  753 
and  561,  based  on  WCDMA  standard  [ICM98].  At  the  receiver  end,  the  soft  decision 
decoding  is  implemented  by  the  standard  Viterbi  decoder  at  the  MF  output.  It  is  shown  that 
for  both  coded  and  uncoded  cases,  the  PreRAKELDP_pc  has  lower  BER  than 
PreRAKELDP_ps . 
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BER 


Transmit  Eb/Nfl  (dB) 


Figure  5.5  Performance  comparison  of  linear  precoding  techniques  in  multipath  fading 
channels,  4  users  with  equal  transmit  powers,  3  channel-paths/user,  single 
transmit  antenna. 
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Transmit  Eb/Nfl  (dB) 

Figure  5.6  Performance  comparison  of  three  space-time  precoding  methods,  8  users 
with  equal  transmit  powers,  3  channel-paths/user. 
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BER 


Number  of  Users 


Figure  5.7  Performance  comparison  of  three  space-time  precoding  methods,  1  to  12 
users  with  equal  transmit  powers,  3  channel-paths/user,  transmit 
Eb/N0=OdB. 
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BER 


Transmit  Eb/NQ  (dB) 

Figure  5.8  Performance  comparison  of  individual  power  scaling  and  total  power 
constraint  for  the  PreRAKELDP,  8  users  with  equal  transmit  powers,  3 
channel-paths/user,  single  antenna. 
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Chapter  6 


CONCLUSIONS 

This  thesis  has  concentrated  on  multiuser  precoding  and  transmitter  diversity  for  the 
downlink  of  DS-CDMA  systems.  We  mainly  focused  on  the  nonlinear  THP  and  linear 
decorrelating  precoding  algorithms,  transmitter-based  multipath  diversity  and  antenna 
diversity.  These  techniques  significantly  increase  the  downlink  capacity  while  keeping  low 
complexity  of  mobile  user  receivers. 

For  the  purpose  of  analyzing  the  duality  between  the  Tx-based  and  Rx-based  MAI 
cancellation  methods,  we  first  introduced  the  Rx-based  linear  and  nonlinear  decorrelating 
multiuser  detection  techniques.  The  proposed  Tx-based  THP  algorithm  uses  the  same  MAI 
cancellation  principle  as  the  Rx-based  DF  multiuser  detector.  In  particular,  for  frequency- 
selective  fading  channels,  the  PreRAKETHP  and  MDTHP  have  the  same  theoretical 
performance  as  the  RDDFR  and  MDDFR,  respectively.  However,  the  Tx-based  methods 
avoid  the  error  propagation  problem;  therefore,  the  actual  performance  of  THP  designs  is 
better  than  that  of  the  DF  multiuser  detectors.  In  addition  to  the  Tx-based  THP  design  applied 
to  downlink  CDMA  channels,  we  also  presented  the  Tx-Rx  joint  THP  optimization  algorithm 
for  MIMO  channels.  In  this  derivative  design,  the  feed-forward  filter  is  shifted  to  the 
receiver;  as  a  result,  the  transmitter  and  receiver  have  more  balanced  complexity. 

By  theoretical  analysis  and  simulation  experiments,  it  is  shown  that  the  THP 
significantly  outperforms  the  linear  decorrelating  precoding  schemes  for  M- ary  PAM  and 
QAM  modulated  systems  when  M  is  large.  However,  for  small  M,  due  to  the  end  effect  and 
power  penalty  of  modulo  operation,  the  performance  advantage  of  THP  over  the  linear 
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methods  diminishes.  Therefore,  the  linear  precoding  techniques  are  more  suitable  for  small- 
constellation  modulated  systems,  such  as  BPSK  and  QPSK.  For  multipath  fading  channels, 
we  proposed  the  linear  precoding  techniques,  PreRAKELDP  and  MDP.  From  the  view  of 
structure  and  performance,  The  Tx-based  PreRAKELDP  and  MDP  for  the  downlink  are 
equivalent  to  the  Rx-based  RDD  and  MDD  for  the  uplink,  respectively.  The  analytical  and 
numerical  results  show  that  the  PreRAKELDP  and  MDP  have  better  performance  than  the 
previously  proposed  linear  precoding  methods  with  similar  complexity,  including  the  linear 
decorrelating  precoding  with  RAKE  receiver,  the  linear  decorrelating  prefilter  and  the  space- 
time  Pre-RAKE  multiuser  precoding. 

To  obtain  the  benefit  of  multipath  diversity,  all  the  proposed  linear  and  nonlinear 
precoding  schemes  utilize  the  Tx-based  multipath  combining.  The  Pre-RAKE  multipath 
combiner,  which  is  used  in  the  PreRAKETHP  and  PreRAKELDP,  is  perfectly  matched  to  the 
multipath  fading  channels.  Thus  it  can  collect  the  signal  energy  to  the  maximum  extent.  The 
PreRAKETHP  and  PreRAKELDP  are  the  optimal  THP  and  linear  precoders,  respectively,  by 
the  ZF  and  MMSE  criteria.  However,  the  performance  gain  for  these  two  methods  is 
achieved  at  the  expense  of  high  computational  complexity,  since  the  precoding  filters  are 
CSI-dependent  and  therefore  have  to  be  updated  frequently.  Both  the  nonlinear  MDTHP  and 
linear  MDP  use  suboptimal  multipath  combining  method.  Although  their  perfonnance  is 
worse  than  the  PreRAKETHP  and  PreRAKELDP,  respectively,  they  have  lower 
computational  complexity  than  the  PreRAKE-based  precoders,  because  their  precoding 
filters  are  independent  of  CSI  and  only  determined  by  the  users’  spreading  codes.  In 
summary,  there  is  a  performance-complexity  tradeoff  between  the  nonlinear  PreRAKETHP 
and  MDTHP,  and  between  the  linear  PreRAKELDP  and  MDP.  We  also  investigated  the 
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combination  of  multiuser  precoding  and  multiple  transmit  antennas.  It  is  shown  that  the 
employment  of  multiple  transmit  antennas  significantly  improves  the  system  performance, 
especially  when  the  transmit  power  is  large. 

A  critical  requirement  for  implementation  of  the  proposed  transmitter-based 
techniques  is  the  knowledge  of  CSI  at  the  transmitter,  which  can  be  satisfied  by  the  long 
range  channel  prediction  (LRP).  It  is  shown  that  for  rapidly  time  varying  fading  channels,  the 
multiuser  precoding  aided  by  LRP  achieves  similar  performance  to  the  case  that  the  CSI  is 
perfectly  known  at  the  transmitter. 
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APPENDIX 


Performance  Comparison  between  Linear  Precoding  with  RAKE  Receiver  and  Linear 
PreRAKE  Precoding 

Consider  a  2-user,  2-channel  paths/user  system  (K=  2,  N=  1).  Suppose  the  transmit 
power  for  the  two  users  are  equal,  i.e.,  A1  =  A2 ,  and  the  channel  gain  coefficients  along 
different  paths  are  i.i.d  random  processes,  and  for  every  user  the  total  average  power  of 
channel  gains  are  normalized  to  one.  From  equation  (5.10),  the  BER  for  the  method  of  linear 
decorrelating  precoding  with  RAKE  receiver  [VJ98]  (named  “Lin.  RAKE”  for  short)  can  be 

written  as  Pej  2y~X~),  where  the  decision  statistic  X,  =  - .Since  the 


ZI* 


l—n  , 


M)  n= 0 


signature  sequences  are  nonnalized,  R-\  =  1 ,  Vi  =  1,2...,  K.  By  the  definition  of  power 
scaling  factor  Sf  in  equation  (5.8),  A)  in  the  considered  2-user  system  is  given  by 
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Given  the  definition  of  correlating  matrix  R,  as  in  section  5.1,  and  R''  =  0,  Vi  ^  j,  due  to 


the  orthogonality  of  signature  sequences,  the  correlation  matrix  for  the  Lin.  RAKE  method  is 
simplified  as 
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Denote  R  = 

be  re-written  as 

X,.  = 


^11  ^12 


_ _ I  12  2  2  2 

Rc  Rf  |=  an  +a22  +al2  +  a2l  .  Equation  (A.l)  can 
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(A. 3) 


Represent  the  BER  formula  for  the  Pre-RDD  as Pei  =  (Q[^2yhiYi  ) .  The  decision 
statistics  for  the  Pre-RDD  is  given  by 
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The  correlation  matrix  for  Pre-RDD,  R ,  is  defined  in  section  5.3.  In  the  discussed  system,  it 
is  equal  to 
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.  Therefore,  (A.4)  is  equivalent  to 
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(A.  6) 


The  BER  formula  for  PreRAKELDP  is  given  by  equation  (5.35).  For  the  convenience 
for  comparison  with  other  methods,  here  we  only  consider  the  case  of  single  transmit 
antenna.  If  represent  the  BER  in  the  form  of  Pet  =  Q{2j2yhiZi  ),  then  the  decision  statistics  for 
PreRAKELDP  is 
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Compare  matrices  R,  and  R  ,  we  find  that  they  have  identical  and  real  diagonal 
elements.  Based  on  the  previously  mentioned  practical  constraints  on  the  channel  gain 
coefficients,  an  approximate  analysis  can  be  given  as  follows:  the  diagonal  elements  of  these 
two  matrices  are  close  to  one,  while  their  non-diagonal  elements  have  the  absolute  values 
much  smaller  than  one;  as  a  result,  an  and  bn  have  close  values  that  are  larger  than  one. 
Therefore,  an  is  usually  larger  than  bu  in  practical  fading  channels. 

Comparing  the  BER  of  the  three  linear  precoders  of  interest  is  equivalent  to  compare 
their  decision  statistics  X„  7,  and  Z„  for  /  =  1,  2.  In  the  special  case  that  the  instantaneous 

power  of  the  channel  gains  equals  unity,  i.e.,  c10  +|cn|  =|c20|  +  c21  =1,  obviously  the 

value  of  Xj  averaged  over  the  channel  gain  coefficients  of  user  i  is  smaller  than  that  of  Y,. 
More  accurate  comparison  of  X„  Y,  and  Z,  is  given  by  the  statistical  simulation  results. 
Without  loss  of  generality,  we  observe  the  values  of  Ai,  Y\  and  Z\  in  the  following  simulation 
example.  Taking  104  samples  of  multipath  Raleigh  fading  coefficient  for  each  channel  path, 
we  calculated  X\,  Y\  and  Z\  according  to  (A. 3),  (A. 6)  and  (A. 7).  Fig.  A.l  shows  the 
histograms  for  X\,  Y\  and  Z\  in  a  3-user  3-channel  paths/user  system.  It  is  observed  that  in 
most  cases,  X\  is  smaller  than  Y\  and  Z\. 

The  above  analysis  indicates  the  better  perfonnance  of  the  Pre-RDD  and 
PreRAKELDP  than  the  Lin.  RAKE  precoder. 
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Number  of  samples 


Figure  A.l  Histograms  for  the  decision  statistics  of  the  linear  decorrelating  precoding 
with  RAKE  receiver  (Lin.  RAKE),  Pre-RDD  and  PreRAKELDP. 
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